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JUVENILE HEMOCHROMATOSIS GENE (HFE2A), 
EXPRESSION PRODUCTS AND USES THEREOF 



1. This patent application claims priority of US provisional 
applications Serial No. 60/462 f 867 t filed 15 April 2003, Serial No. 
US/60/488,607, filed 18 July 2003, and Serial No. 60/498,458, filed 28 August 
2003, the disclosures of which are hereby incorporated by reference in their 
entirety. 

FIELD OF THE INVENTION 

The present invention relates generally to the field of iron metabolism 
diseases, especially juvenile hemochromatosis, to a gene associated therewith, 
and to methods of using this gene, including expression products thereof, for the 
screening and identification of agents useful in the treatment of diseases of iron 
metabolism, including methods of such treatment 

BACKGROUND OF THE INVENTION 

At least 4 iron-overload disorders labeled hemochromatosis have been 
identified on the basis of clinical, biochemical, and genetic characteristics. 
Hemochromatosis type 1 is classic hemochromatosis (sometimes designated 
"HFE") (see OMIM Number 235200; Online Mendelian Inheritance in Man, 
OMIM™. Johns Hopkins University, Baltimore, MD, at 
www.ncbi.nlm.nih.gov/omim/), an autosomal recessive disorder, which is 
caused by mutation in a gene designated HFE on chromosome 6p2'1.3. The 
medical disorder called juvenile hemochromatosis (sometimes called a JH" or 
"juvenile haemochromatosis"), is also known as hemochromatosis type 2 
("HFE2"). Hemochromatosis type 3 (HFE3; OMIM 604250), an autosomal 
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recessive disorder, is caused by mutation , in the gene encoding transferrin 
receptor-2 (TFR2; OMIM 604720). which maps to 7q22. Hemochromatosis 
type 4 (HFE4; OMIM 606069), an autosomal dominant disorder, is caused by 
mutation in the SLC11A3 gene (OMIM 604653), which encodes ferroportin 
5 and maps to 2q32. 

In some families juvenile hemochromatosis shows linkage to 
chromosome 1q21, whereas in others it is caused by mutation in the gene 
encoding hepctdin antimicrobial peptide, which maps to chromosome 19q13. 
The two forms of juvenile hemochromatosis (HFE2) are tentatively designated 

10 HFE2A and HFE2B, respectively. The present invention relates to the genetic 
basis of HFE2A, the form of JH linked to chromosome 1q21 . 

Juvenile hemochromatosis (JH) differs from typical hereditary 
hemochromatosis. While HFE has a prevalent male expression, JH affects 
both sexes equally. JH involves iron accumulation, which begins early in life 

15 and typically causes clinical symptoms before the age of 30 years. JH is a 
more severe disease than typical hereditary hemochomotosis, with JH 
showing hypergonadotropic hypogonadism, heart failure, arrhythmias and/or 
cardiomyopathy as frequent features. If untreated, the disease is lethal 
because of cardiac and other complications. 

20 Identification of 1q21 as the chromosomal location of HFE2A was first 

reported in Roetto et ah, Am. J. Hum. Genet 64:1388-1393 (1999) but this 
locus did not correspond to the chromosomal location of any known gene 
involved in iron metabolism. 

The present invention provides identification of the hereditary basis for 

25 HFE2A, thereby facilitating development of more potent agents for treating 
diseases of iron metabolism. Administration of the HFE2A gene or protein 
itself may be therapeutic. Alternatively, the underlying genetic mutation 
identifies a novel therapeutic target for treating diseases of iron metabolism. 
This therapeutic target can be used to identify and discover more effective 

30 therapeutic agents. Diagnostic compounds, kits and methods using HFE2A 
are also included which may be used to diagnose JH as well as to diagnose 
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and predict onset and severity of adult hemochromatosis and to distinguish 
between types of iron metabolism disorders. 

BRIEF SUMMARY OF THE INVENTION 

5 The invention relates to the discovery that juvenile hemochromatosis 

(hemochromatosis type 2A, or HFE2A), is caused by a mutation in a human 
gene found at 1q22 having the nucleotide sequence as set out in SEQ ID 
Nos. 1-9, and/or the corresponding amino acid sequences as set out in SEQ 
ID Nos. 10-12. The gene and the protein are referred to herein as HFE2A 
10 and also as hemojuvelin, these words referring the gene, the gene product 
and the protein expressed therefrom, unless the context specifies otherwise. 
The gene has also been named HFE2, by which is meant the form JH caused 
by the gene at 1q21. This naming protocol is not essential to the invention 
claimed herein. 

15 In one aspect, the present invention relates to a method for identifying 

an agent that modulates hemojuvelin, comprising contacting a test compound 
with hemojuvelin and determining a change in hemojuvelin activity due to the 
compound, thereby identifying a modulator of the type being sought. The 
modulator may be a drug-like small molecule, an antibody, an antisense 

20 nucleic acid, a ribozyme or any other compound which changes the activity of 
the gene or protein. 

In one aspect, the present invention relates to method for identifying an 
agent that modulates HFE2A gene expression, comprising contacting a test 
compound with a polynucleotide comprising a HFE2A gene and under 

25 conditions promoting expression of said gene (i.e., conditions wherein the 
polynucleotide is being expressed) and determining a change in expression 
due to the presence of the test compound. This identifies the test compound 
as such a modulator. In a preferred embodiment, this change in expression is 
detected as a change in transcription of the gene, preferably where the gene 

30 is a mammalian gene and most preferably where the gene is expressed by a 
cell. 



3 



WO 2004/092405 PCT/CA2004/000522 

In another aspect, the present Invention relates to a method for 
identifying an agent that modulates HFE2A gene expression, comprising 
contacting a test compound with a reporter gene operably linked to a 
promoter sequence and determining a change in expression of the reporter 
5 gene due to the test compound, thus identifying a modulator of the construct. 
Preferably, this is modulation of transcription of said reporter gene operably 
linked to an HFE2A promoter, most preferably a« mammalian HFE2A, such as 
a mouse, rat or human HFE2A promoter. 

In a further aspect, the present invention relates to a method for 

10 treating and/or preventing a disorder in an animal comprising administering to 
an animal afflicted therewith, or at risk of developing said disorder, a 
therapeutically effective amount of an HFE2A modulator. In a preferred 
embodiment, the HFE2A modulator exhibits modulating activity in a method of 
the invention, most preferably wherein said agent was first identified as an 

15 HFE2A modulator using said method and was not otherwise known to have 
such activity. 

In a yet further aspect, the present invention relates to a method to 
diagnose individuals afflicted with or at risk of developing HFE2A or a related 
disorder comprising determining the nucleic acid sequence of the HFE2A 

20 gene in said individual wherein a mutation or polymorphism of said gene 
identifies said individual as an individual afflicted with or at risk of developing 
HFE2A or a related disorder. 

The present invention also contemplates a method for identifying a 
compound capable of modulating a HFE2A activity, comprising: (a) contacting 

25 a cell which expresses HFE2A with a test compound; and (b) assaying the 
ability of the test compound to modulate the transcription of a HFE2A nucleic 
acid or the activity of HFE2A polypeptide, thereby identifying a compound 
capable of modulating a HFE2A activity. In a preferred embodiment of such 
method, the compound is an anti-HFE2A polypeptide antibody, a ribozyme or 

30 is an antisense HFE2A nucleic acid molecule. In one preferred embodiment 
thereof, the compound is a HFE2A ribozyme. 
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The present invention also encompasses an isolated polynucleotide or 
isolated polypeptide of the genes disclosed herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 Figure 1 is a schematic of LOC1 48738 Gene structure. The 

LOC1 48738 Gene has four exons and five transcripts. The five transcripts 
code for 3 proteins of 200 AA f 313 AA and 426 AA. Exon 2 was predicted by 
Ensembl from cDNA sequence AK098165.1. The acceptor site for Exon 2 is 
inferred from the genomic sequence. Exons 3a and 3b have the same 
10 acceptor site but different donor sites. Untranslated sequence is white, 
translated sequence is black. The transcripts are listed from longest to 
shortest. 

Figure 2 is an alignment of the human, mouse and rat promoter 
sequences for HFE2A. The transcriptional start site of the human sequence 

15 begins at position 1514 of this alignment and exon 1 ends at position 1676. 
Three repetitive sequences are present within the human sequence, a 
(TAGA)n at position 58 to 129, an alu repetitive element at position 324 to 603 
and a (GGAA)n at position 604 to 702. These sequences are not present in 
the rodent sequences and possibly define the promoter boundary for the 

20 gene. Sequence homology between species is lost 5' of position 844 of this 
alignment, about 670 bp 5* of the transcriptional start. A putative 
transcriptional initiation site (TATA) is conserved in all three species, at 27 to 
24 bp 5' of the transcriptional start in the human. 

Figure 3 shows a Protein Sequence Alignment of Human LOC1 48738 

25 426 aa Gene Product (Protein3) and human paralogs; Human Repulsive 
Guidance Molecule (Human RGM) (NCBI: NP_064596.1) on Chromosome 15 
and ENSESTP00000023393 (Ensembl) identified on Chromosome 5.- 
Comparison of human paralogs 

Figure 4 shows a Sequence Alignment of LOC148738 sequences 

30 using clustalx. Human (Protein3), Mouse (Ensembl: 
ENSMUSESTP00000016634), Rat (Translated from NCBI: AK098165.1 
Though annotated as human sequence it is 99.9% identical to Rat genomic 
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sequence and only 86.6% identical to human genomic sequence) and Fugu 
(Ensembl: SINFRUP000001 38308). 

Figure 5 shows a Protein Sequence Alignment of Human LOC 148738 
gene product and Chicken Repulsive Guidance Molecule (RGM). Human 
5 (ProteinS), Chicken_RGM (Translation of NCBI: AY1 28507.1). A = Signal 
Peptide, B = RGD, C = N-Glycosylation site, D = Partial von Willebrand Type 
D Domain, E = GPI modification , Arrow = Possible cleavage site. 

Figure 6A shows a summary of high degree of similarity between 
homologs of several species. 

10 Figure 6B shows the high degree of similarity between genes similar to 

HFE2A in different species. 

Fjg 7(a) - Structuralfeatures in the 426 aa translated open reading 
frame of LOC148738. RGD - cell attachment tri amino acid motif, vWF - 
Partial von Willebrand factor type D domain and TM - Transmembrane 

15 domain. The arrows mark a possible proteolytic site between positions 172 
and 173, a furin cleavage site between positions 335 and 336 and a predicted 
GPI modification site at position 400. Fig. 7(b) shows the prediction of signal 
peptide cleavage site in 426 aa translation of HFE2A using SignalP (Henrik 
Nielsen et al. Protein Engineering, 10, 1-6 (1997)). Peptide cleavage site is 

20 predicted to be between S35 and Q36. Figure 7(c) shows the prediction of 
transmembrane regions in 426 aa translation of HFE2A using TMHMM. The 
transmembrane region is from residues 403-425. Figure 7(d) shows the 
identification of partial von Willebrand Type D domain (57%) using NCBI 
Conserved Domain Search. (LOC148738 = HFE2A); Figure 7(e) shows 

25 Possible proteolytic cleavage site (arrow) between residues 172D and 173P 
identified by sequence comparison with Chicken Repulsive Guidance 
Molecule (RGM). 

Figure 8 shows Nucleic acid and amino-acid sequence of 426 aa open- 
reading frame translated from Transcript4 of LOC148738 (Protein3). This 
30 protein is identical to the Ensembl protein ENSP00000304614 from the 
predicted Ensembl transcript ENST00000306561 . Box, initiating codon; 
triangle, potential signal peptide cleavage site; dots, RGD site; cross, 
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predicted Q-glycosylation sites (NetOGIycZO predicted); diamonds, predicted 
N-glycosylation sites (NetNGIyc 1.0 predicted); arrow, putative cleavage site 
from comparison with chicken RGM; double arrow, furin cleavage site; 
square, predicted GPI anchor modification site; asterix, stop codon. 
5 Fig 9 (a) - (e) - shows family pedigrees used to identify genetic 

mutation causing juvenile hemochromatosis. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a gene and its corresponding protein 
10 related to diseases of iron metabolism, referred to as hemojuvelin or HFE2A. 
Hemojuvelin and HFE2A refers to both the gene, the gene product and the 
protein expressed therefrom, unless the context specifies otherwise. 

In accordance with the present invention, the human HFE2A gene is 
disclosed. This gene, when mutated, results in juvenile hemochromatosis 
15 (hemochromatosis type 2A). The gene has also been named HFE2, by which 
is meant the form JH caused by the gene at 1q21. The gene product/protein 
is also sometimes called "hemojuvelin." This naming protocol is not essential 
to the invention claimed herein. 

In one aspect, the invention relates to the nucleic acid sequence for 
20 HFE2A, including the genomic sequence, mRNA or cDNA, polymorphic, 
allelic, isoforms (adult, neo-natal, etc.) and mutant forms thereof, and nucleic 
acid constructs of the gene, including vectors, plasmids and recombinant cells 
and transgenic organisms containing or corresponding to HFE2A (or knock- 
outs thereof). Such nucleic acid sequences are set form in SEQ. ID NOS. 1 - 
25 9, and 13 -22. 

In another aspect, the invention relates to the gene product of HFE2A, 
sometimes called herein hemojuvelin or HFE2A polypeptide, including the 
polypeptide, protein, and amino acid sequence, and the polymorphic, allelic, 
isoforms (adult, neo-natal, etc.) and mutant forms thereof, mRNA or other 
30 transcripts of HFE2A and recombinant cells and transgenic organisms 
wherein this polypeptide or a polypeptide corresponding thereto is expressed. 
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Such amino acid sequences are set forth in SEQ. ID NOs: 10-12 and 23 - 
28. 

In one aspect of the present invention, HFE2A is incorporated into a 
screening assay whereby compounds (potential therapeutic agents) are 
5 tested to determine if they modulate HFE2A gene expression activity, thereby 
identifying potential therapeutic agents. 

In accordance with the foregoing, the present invention relates to 
method for identifying an agent that modulates HFE2A gene expression, 
comprising: 

10 a) contacting a test compound with a cell expressing an HFE2A gene; 

b) determining a change in expression of said HFE2A gene as a result 

of said contacting, 

wherein a change in expression identifies said test compound as an 
agent that modulates HFE2A gene expression. 

15 In a preferred embodiment, this change in expression is detected by 

detecting a change in transcription of said gene, such as where there is a 
decrease or increase in transcription. In another preferred embodiment, the 
HFE2A gene is a mammalian HFE2A gene, most preferably where the 
mammal is a mouse, rat or human. 

20 In another preferred embodiment, the gene whose modulation is to be 

determined is present in a mammalian cell, preferably a recombinant cell, 
including where the cell is a macrophage, inflammatory cell, liver cell, 
hepatocyte, intestinal cell, hematopoietic cell, pancreatic cell, skeletal muscle 
cell or a cell of the nervous system. 

25 In another aspect, the present invention relates to a method for 

identifying an agent that modulates the activity of an HFE2A polypeptide, 
comprising: 

a) contacting a test compound with an HFE2A polypeptide and under 
conditions supporting an activity of said polypeptide; and 
30 b) determining a change in said activity of said polypeptide as a result 

of said contacting, 
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wherein a change in the activity identifies said test compound as an 
agent that modulates the activity of an HFE2A polypeptide. 

In preferred embodiments of such methods, the HFE2A-encoded 
polypeptide is a mammalian HFE2A polypeptide, most preferably from mouse, 
5 rat or human. 

In other preferred embodiments, the modulation is a decrease or 
increase in said biological activity. In a highly useful embodiment, the 
polypeptide is in, or is part of, a cell, preferably a mammalian cell, most 
preferably a recombinant cell, including where the cell is a macrophage, 

10 inflammatory cell, liver cell, hepatocyte, intestinal cell, hematopoietic cell, 
pancreatic cell, skeletal muscle cell or a cell of the nervous system. In other 
preferred embodiments, the cell has been engineered to contain said 
polypeptide, such as by genetic engineering, especially where said cell does 
not express said polypeptide absent said engineering. 

15 In an additional preferred embodiment, the polypeptide is encoded by a 

polynucleotide having the sequence selected from the group consisting of 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8 and 9, most preferably wherein said 
polypeptide comprises an amino acid sequence selected from the sequence 
of SEQ ID NO: 10, 11 and 12. 

20 The present invention also relates to a cell line comprising a one or 

more of said polynucleotides of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8 and 9, 
including recombinant forms of the polynucleotides, as well as a cell line 
comprising one or more of the polypeptides of SEQ ID NO: 10, 11 and 12, 
preferably a recombinant form of the polypeptide. In another embodiment, the 

25 present invention relates to use of such a cell line in a method for identifying 
compositions which modulate expression or activity of hemojuvelin. 

In a further aspect, the present invention relates to a method for 
treating and/or preventing a disorder in an animal comprising administering to 
an animal afflicted therewith, or at risk of developing said disorder, a 

30 therapeutically effective amount of an HFE2A modulator. In a preferred 
embodiment, the HFE2A modulator exhibits modulating activity in a method of 
the invention, most preferably wherein said agent was first identified as an 
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HFE2A modulator using said method and was not otherwise known to have 
such activity. In a preferred embodiment of such method, the HFE2A 
modulator is a selective HFE2A agonist, or a selective HFE2A antagonist 
including pharmaceutical^ acceptable salts thereof, and/or any combinations 
5 of these, including where these are in a pharmaceutical^ acceptable carrier. 
In an embodiment, said disorder is a disease of iron metabolism, including an 
iron overload disorder and/or an iron deficiency disorder. Such disease of iron 
metabolism may also include one or more of Type I diabetes, Type II diabetes 
and insulin resistance. 

10 In a yet further aspect, the present invention relates to a method to 

diagnose individuals afflicted with or at risk of developing HFE2A or a related 
disorder comprising determining all or part of the nucleic acid sequence of the 
HFE2A gene in said individual wherein a mutation or polymorphism of said 
,gene identifies said individual as an . individual afflicted with or at risk of 

15 developing HFE2A or a related disorder. In another such embodiment, the 
present invention relates to a method to diagnose individuals affected by or at 
risk of developing HFE2A or a related disorder comprising determining all or 
part of the amino acid sequence of- HFE2A polypeptide in said individual 
wherein a mutation or polymorphism of said gene identifies said individual as 

20 an individual affected by or at risk of developing HFE2A or a related disorder. 

The present invention also contemplates a method for identifying a 
compound capable of modulating a HFE2A activity, comprising: (a) contacting 
a cell which expresses HFE2A with a test compound; and (b) assaying the 
ability of the test compound to modulate the transcription of a HFE2A nucleic 

25 acid or the activity of HFE2A Polypeptide, thereby identifying a compound 
capable of modulating a HFE2A activity. In a preferred embodiment of such 
method, the compound is an anti-HFE2A polypeptide antibody, a ribozyme or 
is an antisense HFE2A nucleic acid molecule. In one preferred embodiment 
thereof, the compound is a HFE2A ribozyme. In another embodiment, the 

30 compound is a small organic molecule. 
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The present invention also encompasses an isolated polynucleotide 
comprising a polynucleotide having a nucleotide sequence with at least 60 
percent identity to a sequence selected from the group consisting of SEQ ID 
NO: 1. 2, 3, 4, 5, 6 f 7, 8, 9, 19, 20, 21 and 22. In other embodiments thereof, 
5 said percent identity is at least 70 percent, more preferably at least 78 
percent, even more preferably at least 90 percent, yet more preferably at least 
95 percent, still more preferably at least 98 percent, and most preferably 
where the nucleotide sequence is a member selected from the group 
consisting of SEQ ID NO: 1, 2. 3, 4, 5, 6, 7, 8, 9, 19, 20, 21 and 22. 

10 In another such embodiment, the present invention specifically 

encompasses an isolated polypeptide comprising a polypeptide having an 
amino add sequence with at least 90 percent identity to an amino acid 
sequence selected from the group consisting of SEQ ID NO: 10, 11, 12, 23, 
24; 25, 26, 27 and 28, preferably at least 95 percent, more preferably at least 

15 98 percent and most preferably where the amino acid sequence is selected 
from the group consisting of SEQ ID NO: 10, 11, 12, 23, 24, 25, 26, 27 and 
28. 

The present invention also encompasses embodiments where the 
agents identified by the methods of the invention, or useful in the methods of 

20 the invention, may be present in the form of a composition for treating a 
disease of iron metabolism comprising a therapeutically effective amount of a 
polypeptide of the invention, or a pharmaceutically active fragment thereof, in 
a pharmaceutically acceptable carrier. In accordance therewith, the present 
invention specifically contemplates a method for treating a disease of iron 

25 metabolism comprising administering to a patient in need thereof a 
therapeutically effective amount of such a composition. In an embodiment 
thereof, the disease is juvenile hemochromatosis. 

In a further aspect, the present invention relates to a method for 
identifying an agent for the treatment of a disease of iron metabolism, 

30 comprising: 

a) administering to an animal an agent found to have activity using a 
method as disclosed herein, and 
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b) detecting in said animal a change which is indicative of a change in 
iron metabolism due to said administering, 

thereby identifying an agent for the treatment a disease of iron 
metabolism. 

5 Preferably, the animal is a mammal, such as a human being. In specific 

embodiments, the change in iron metabolism may be a change in storage tissue 
distribution of iron. 

In a preferred embodiment of the invention, the compound identified 
which modulates HFE2A is selective for HFE2A ahead of related genes or 

10 proteins, and therefore does not significantly modulate the activity of such other 
related genes or proteins by direct interaction. 

In a further aspect, the present invention relates to a method for treating 
a condition in an animal afflicted with a disease of iron metabolism comprising 
administering to said animal an effective amount of an agent first identified by an 

15 assay method of the invention. Preferably, said animal is a human patient, such 
as a patient afflicted with a chronic disease of iron metabolism. 

In a further aspect, the compounds identified by the assays disclosed 
herein, and the methods of treating patients with such compounds, are applied 
to alternative or additional indications beyond diseases of iron metabolism, 

20 which are found to be treatable by modulating HFE2A activity. 

In a further aspect, the present invention relates to compounds that 
modulate HFE2A that may now be identified by those skilled in the art. These 
compounds include antibodies, antisense compounds, gene therapy vectors 
and proteins, and small molecule organic compounds.. The invention also 

25 comprises the use of these compounds for treatment of diseases associated 
with aberrant HFE2A activity. In one embodiment, the use of the compounds for 
treatment of disease of iron metabolism. 

In another aspect, the present invention relates to a method for 
producing a product comprising identifying an agent according to the process 

30 of the invention wherein said product is the data collected with respect to said 
agent as a result of said process and wherein said data is sufficient to convey 
the chemical structure and/or properties of said agent 



12 



WO 2004/092405 



PCT/CA2004/000522 



In another aspect, the present invention relates to diagnostic and 
pharmacogenomic compositions, kits and methods which identify the 
presence or absence in a patient of one or more mutations in a polynucleotide 
5 or a polypeptide corresponding to SEQ ID NOs: 1-12, including those specific 
mutations identified at SEQ ID NOs 14, 16, and 18, and other mutations that 
may be subsequently discovered to be relevant. This embodiment is useful in 
diagnosing the presence or absence of juvenile hemochromatosis. It is also 
useful for diagnosis of adultonset hemochromatosis, and for the prediction of 
1 0 onset and severity thereof. 

Figure 1 illustrates alternative splice variants that are generated during 
transcription of the HFE2A gene. For convenience, the nucleotide sequences 
of the coding exons are set out as follows: 



SEQ ID NO. 


Description 


1 


Sequence of Exon 1 


2 


Sequence of Exon 2 


3 


Sequence of Exon 3a 


4 


Sequence of Exon 3b 


5 


Sequence of Exon 4 



15 ~ 

The alternative splice variants generated from the gene produce at 
least 5 transcripts. The nucleotide sequence of each of transcripts 1 to 4, 
corresponding to a cDNA sequence (or a processed mRNA sequence if T is 
converted to U) is set out as follows: 

20 



SEQ ID NO. 


Description 


6 


Wild-Type Human HFE2A putative cDNA - Transcript 1 (exon 1, 
4) 


7 


Wild-Type Human HFE2A putative cDNA - Transcript 2 (exon 1, 
3a. 4) 


8 


Wild-Type Human HFE2A putative cDNA - Transcript 3 (exon 1, 
3b, 4) 


9 


Wild-Type Human HFE2A putative cDNA - Transcript 4 (exon 1, 
2, 3b. 4) 
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Each of these four transcripts of the HFE2A gene may be translated 
into an HFE2A polypeptide. Transcripts 1 and 2 generate the same full length 
proteins, hence there are 3 full-length proteins of 200 amino acids (a.a.), 313 
a.a. and 426 a.a., respectively. The amino acid sequence generated from 
5 each transcript is set out as follows: 



SEQ ID NO. 


Description 


10 


Wild-Type Human HFE2A Polypeptide - Protein 1 (T ranscripts 1 & 
2) 


11 


Wild-Type Human HFE2A Polypeptide - Protein 2 (Transcript 3) 


12 


Wild-Type Human HFE2A Polypeptide - Protein 3 (Transcript 4) 



The longest cDNA sequence (Transcript 4, SEQ ID No, 9) and the 
longest protein sequence (Protein 3, SEQ ID No. 12) have been used as the 
basis for the sequencing numbering convention used in the instant 



10 specification. 

Aspects of the sequence of HFE2A have been previously published in 
various sources, including the following: Homo sapiens Official Gene Symbol 
and Name: LOC148738; NCBI LocusID: 148738; Other Names: 
ENSG000001 68509 (Ensembl v. 12.31.1), NMJ45277 (RefSeq); Genomic 

15 Location: 142000393 - 142004657 bp on chromosome 1 (Assembly: NCBI 31 
assembly of the human genome, Freeze Date: November 2002) (Ensembl 
Version for reference sequences: Ensembl v. 12.31.1.) An incomplete 
fragment of HFE2A is contained in PCT publication WO 02/051438 where it 
was listed as RGM3 and implicated as a Repulsive Guidance Molecule in 

20 neuronal development (3 amino adds, 9 nt missing from 5' coding end). 

A positional cloning strategy, set out in Example 1 below, was used to 
confirm that chromosomal region 1q21 was highly associated with juvenile 
hemochromatosis. Candidate genes were investigated to determine if 
mutations were present The HFE2A gene described herein was determined 

25 to contain a variety of deleterious mutations as follows: 



30 
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Table 1. 



Mutation 


Type 


Sequence 


c.296G>T,p.G99V 


WT 


acctgccgcgGggacctcgcc 
(SEQIDNO:31) 




MT 


acctgccgcgTggacctcgcc 
(SEQ ID NO: 32) 


c.842T>C,p.l281T 


WT 


gctgcctacaTtggcacaact 
(SEQ ID NO.13) 




MT 


gctgcctacaCtggcacaact 
(SEQ ID NO.14) 


c.959G>T,p.G320V 


WT 


tgtgttggggGgtgccctcca 
(SEQ ID NO.15) 




MT 


tgtgttggggTgtgccctcca 
(SEQ ID NO.16) 


c.1079delC,p.C361fsX366 


WT 


tacttccattCctgtgtcttt 
(SEQ ID NO.17) 




MT 


tacttccatt ctgtgtcttt 
(SEQ ID NO.18) 


c.976C>T, p. R326X 


WT 


tccaagtcagCgactctctcg 
(SEQ ID NO. 29) 




MT 


tccaagtcagTgactctctcg 
(SEQ ID NO. 30) 



5 Table 1 shows genetic variations identified in HFE2A (ENSE00001277351). 
The cDNA position of the mutation is represented by the number in the first 
column (i.e.; c.665). The mutations are numbered starting from the initiating 
ATG codon in ENST00000306561 (transcript 4) (SEQ ID NO. 9). For each 
variation terv bases of sequence on either side is presented with the wildtype 
10 (WT) and mutant (MT) base capitalized. 



Table 2. 



Variation 


JH3 


JH4 


JH5 


JH6 


JH7 


JH8 


JH9 


JH10 


JH11 


JH12 


c.296G>T,p.G99V 














TT 








c.842T>C,p.l281T 




CC 


















c.959G>T,p.G320V 


TT 




TT 


TT 


TT 






TT 


GT 


TT 


c.1079delC,p.C361f 
sX366 












del/del 










c.976C>T. p. R326X 


















TC 
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Table 2 shows genetic variations identified in Juvenile Hemochromatosis 
families. The first column describes the variation, the other columns present 
the genotype of each chromosome at the site of mutation of a proband in 
each family. JH3, 4, 5, 6, 7, 8, 9, 10 and 12 are homozygous for the variation. 
5 JH1 1 is a compound heterozygote with two variations. 

Segregation of the indicated mutations in HFE2A with affected HFE2A 
patients has now been confirmed in all affecteds in the families identified in 
Table 2. The variation is not found in 1 28 European and 37 Greek control 
(unaffected) individuals. Thus, the presence of 7 families having one coding 
10 change resulting in a G320V, one family having a unique coding change 
resulting in a I281T, and two other families having a premature truncation of 
this protein is convincing evidence that mutations in HFE2A (Gene 3 
(LOC148738) ENSG0O000 168509 v.12.31.1) cause Juvenile 
Hemochromatosis. 

15 Another variation occurred in control samples (i.e. unrelated persons 

not affected with JH). Two of 128 Dutch controls are heterozygous for 
c.1207G>A, p.V403l. Since it was heterozygous in these two samples, no JH 
phenotype would be expected to be apparent, nor is it found in these 
individuals. This variation could be a single nucleotide polymorphism (SNP) 

20 in the Dutch population which is not deleterious. Alternatively, it may be a 
potentially causal mutation of JH were it to appear in the homozygous form. 
At this point it is just a SNP, but it may be useful for diagnostic, or therapeutic 
purposes. 

A significantly conserved genomic DNA promoter nucleic acid 
25 sequence for HFE2A has been identified in humans, mouse and rat as 
follows: 



SEQ ID NO. 


Description 


19 


Promoter sequence - human 


20 


Promoter sequence - mouse 


21 


Promoter sequence - rat 
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Alignment of the promoter sequences in Figure 2 shows the high 
degree of conservation between species in the regulatory region. Sequence 
conservation is substantially lost about 670 nt 5' to the transcription initiation 
site in the human sequence. 
5 Genomic sequence: the full human genomic sequence illustrating the 

nucleic acid sequence around HFE2A Is set forth at SEQ ID NO. 22. Exons 
are indicated in capital letters. 

In accordance therewith, a gene useful in the methods of the invention 
comprises a polynucleotide corresponding to a polynucleotide having a 
10 nucleotide sequence selected from the group consisting of SEQ ID NO: 19, 
20, 21 and 22. 

In another aspect, the present invention relates to a method for 
identifying an agent that modulates HFE2A gene expression, comprising: 

(a) contacting a test compound with a genetic construct comprising a 
15 reporter gene operably linked to an HFE2A promoter under conditions 

supporting expression of said reporter gene; 

(b) determining a change in expression of said reporter gene as a 
result of said contacting, 

wherein a change expression identifies said test compound, or 
20 confirms said test compound, as an agent that modulates HFE2A gene 
expression. 

In preferred embodiment thereof, the modulation is a decrease or 
increase in expression, especially transcription, of said reporter gene and 
includes embodiments where the HFE2A promoter is a mammalian HFE2A 
25 promoter, preferably a mouse, rat or human HFE2A promoter. In a highly 
preferred embodiment, the promoter has a nucleotide sequence of SEQ ID 
NO: 19. 

In a preferred embodiment, the expression of said reporter gene is 
determined by measuring the activity of an enzyme, such as where the 
30 reporter gene encodes a protein having a readily measurable enzyme activity. 
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As used herein, the term "identifying" can include first identifying said 
test compound as an agent that modulates gene expression or a polypeptide 
activity (i.e., where the agent was not previously known or suspected of 
having this activity) or can include confirming such activity (where the agent 
5 was already suspected of having this activity but was not clearly shown to 
have such activity) by performing one or more of the assays of the invention. 
For example, where an agent has previously been implicated in modulation of 
HFE2A gene or polypeptide activity, or where such agent was suspected, or 
otherwise believed to be, an agent capable of altering, modulating, or 

10 affecting iron metabolism, such as where the agent has been implicated in 
regulating iron metabolism, and the activity of such agent is confirmed to have 
such activity utilizing a method of the invention, such agent is deemed to have 
been identified as having the indicated activity. 

In another aspect, the present invention relates to a method for 

1 5 identifying an agent that modulates HFE2A polypeptide activity, comprising; 

(a) contacting a chemical agent with said HFE2A polypeptide under 
conditions promoting proteinrprotein interactions, and/or 

(b) contacting a chemical agent with said HFE2A polypeptide under 
conditions promoting HFE2A ligand HFE2A receptor interactions, 

20 and/or 

(c) contacting a chemical agent with said HFE2A polypeptide under 
conditions promoting modifications in lipid composition, and/or 

(d) contacting a chemical agent with said HFE2A polypeptide under 
conditions promoting alterations in associated second messenger 

25 signaling 

wherein such interaction, alteration or modification indicates 
modulation and thereby identifying the chemical agent as an agent that 
modulates HFE2A polypeptide activity. 

30 HFE2A activity may be mediated through alteration of an associated 

second messenger. 
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In one highly useful embodiment, the genetic construct is in a cell, 
preferably a mammalian cell, most preferably a recombinant cell, such as 
where the mammalian cell is a macrophage, liver cell, hepatocyte, intestinal 
cell, inflammatory cell, liver cell, hepatocyte. intestinal cell, hematopoietic cell, 
5 pancreatic cell, skeletal muscle cell or a nervous system cell. 

Table 3 below sets forth other polymorphisms that have been identified 
in the HFE2A gene of control individuals (i.e. those with.no clinical 
manifestation of Juvenile Hemochromatosis or other iron metabolism 
disorder). These polymorphisms, and others that may be discovered by those 
10 skilled in the art, may be useful for diagnostic or therapeutic or other 
purposes. 



TABLE 3 



Mutation 


Type 


Sequence 


C.1-150G, na 


WT 


acctgggaaaCctgqctaqat (SEQ ID NO: 33) 




MT 


gcctgggaaaGctggctggat (SEQ ID NO: 34) 


c.98-6C>G,intronic 2-3 


WT 


tcccttctgtCtttagctcat (SEQ ID NO: 35) 




MT 


tcccttctgtGtttagctcat (SEQ ID NO: 36) 


c.205_206insAGG, 
p.G69 70insG 


WT 


gaggaggagg— ccggggtgga (SEQ ID NO: 37) 




MT 


gaggaggaggAGGccggggtgga (SEQ ID NO: 38) 


C.4320G, p.A144A 


WT 


gcctccctgcCccggaccctt (SEQ ID NO: 39) 




MT 


gcctccctgcGccggaccctt (SEQ ID NO: 40) 


c.483C>A, p.P161P 


WT 


atggtcgtccCccggggttct (SEQ ID NO: 41) 




MT 


atggtcgtccAccggggttct (SEQ ID NO: 42) 


c.488G>C. p.G163A 


WT 


cgtcccccggGgttcttgcat (SEQ ID NO: 43) 




MT 


bgtcccccggCgttcttgcat (SEQ ID NO: 44) 


C.5690A, p.A190D 


WT 


gtccaaggagCttggcctcta (SEQ ID NO: 45) 




MT 


gtccaaggagAttggcctcta (SEQ ID NO: 46) 


c.627G>T, p.A209A 


WT 


cccccatggcGttgggggcca (SEQ ID NO: 47) 




MT 


cccccatggcTttgggggcca (SEQ ID NO: 48) 


c.682C>A, p.Q228K 


WT 


taagaacatgCaggaatgcat (SEQ ID NO: 49) 


I 


VIT 


taagaacatgAaggaatgcat (SEQ ID NO: 50) 


C.9290G, p.A310G 


NT 


gccttctcagCtgaacaggac (SEQ ID NO: 51 ) I 




\AT 


jccttctcagGtgaacaggac (SEQ ID NO: 52) 


c.1207G>A, p.V403l 


NT 


jgatgctgggGttcctctttc (SEQ ID NO: 53) 




\AT i 


jgatgctgggAttcctctttc (SEQ ID NO: 54) 
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Capital letter indicates site and nature of polymorphism. WT = wild-type; MT 
= mutation. 

Similarity to other Human genes 
5 Figure 3 illustrates an alignment between HFE2A and 2 related 

proteins found in the human genome. The similarity suggests that some 
functionality may be shared or similar between these proteins. 

Table 4 shows a comparison of Human LOC1 48738 426 aa protein 
sequence (Protein3) to Human Paralogs* using BLAST** 
10 Table 4. 



Paralog 


Score 


Alignment 
(a.a.) 


Identity 

% 


Positive 

% 


Gaps 

% 


Human RGM 
(SEQ ID No. 23) 


370 
bits 
(939) 


432 


48% 


60% 


12% 


ENSESTP0000002339 
3 (SEQ ID No. 24) 


333 
bits 
(844) 


444 


42% 


55% 


13% 



e ENSESTP00000023393 (Ensembl) on Chromosome 5 and Human 
Repulsive Guidance Molecule (Human RGM) (NCBI: NP_064596.1) on 
15 chromosome 15 

**Sequence comparison using "BLAST 2 Sequences" (Tatiana A. 
Tatusova, Thomas L. Madden (1999), FEMS Microbiol Lett 174:247-250) 
Parameters: blastp. BLOSUM62 matrix, open gap penalty of 11, extension 
20 gap penalty of 1 , gap x_dropoff of 50, word size of 3, expect 10 

Conservation of HFE2A in other organisms. 

2. Homologs of human HFE2A gene were found in mouse (Mus 
25 musculus), rat (Rattus norvegicus) and fish (Takifugu rubripes ("Fugu")). 
Figure 4 sets out a sequence comparison at the nucleotide level of these 
sequences. The high degree of conservation of these sequence is shown in 
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Table 5, which shows a comparison of Human LOC1 48738 426 aa protein 
sequence (Protein3) to Orthologs* using BLAST** 



5 Table 5. 



SPECIES 


Scores 


Alignment 
(a.a.) 


Identity % 


Positive % 


Gaps 
% 


Mouse 

SEQ ID No. 25 


758 bits 
(1936) 


420 


88% 


90% 


0% 


Rat 

SEQ ID No. 26 


752 bits 
(1920) 


426 


86% 


89% 


0% 


Fugu 

SEQ ID No. 27 


336 bits 
(853) 


383 


46% 


61% 


4% 



* Organism sequences: Mouse (Ensembl: 

ENSMUSESTP00000016634), Rat (Translated from NCBI: AK098165.1) and 
Fugu (Ensembl: SINFRUP000001 38308). 

^Sequence comparison using "BLAST 2 Sequences 1 ' (Tatiana A. 
Tatusova, Thomas L. Madden (1999), FEMS Microbiol Lett. 174:247-250) 
Parameters: blastp, BLOSUM62 matrix, open gap penalty of 11, extension 
gap penalty of 1 , gap x_dropoff of 50, word size of 3, expect 1 0 

Comparison of human HFE2A with chicken RGM is shown in Figure 5. 
The 426 aa sequence translation of Human LOC148738 (Protein3 identical to 
ENSP00000304614) shares significant sequence homology with Chicken^ 
Repulsive Guidance Molecule (RGM) (Translation of NCBI: AY1 28507.1). 
Chicken RGM amino acid sequence is listed at SEQ ID No. 28. Using 
"BLAST 2 Sequences" (Tatiana A. Tatusova, Thomas L. Madden (1999), 
FEMS Microbiol Lett. 174:247-250) 395 of the residues align with 46% 
Identity, 61% Positives and 9% Gaps. The two proteins also share a N- 
terminal signal peptide, a RGD tri-amino acid motif that could be involved in 
cell attachment, a site identified as a possible cleavage site in Chicken RGM, 
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a partial (-57%) von Willebrand Type D domain, and a predicted N- 
Glycosylation site. Both proteins are also predicted to have GPI modifications 
at the Oterminus . 

A visual overview of similarities shared between human HFE2A and 
5 homologs in mouse, rat and fugu, as well as sequences from Macac RGM, 
Chicken RGM, Xenopus RGM and C.EIegans NP are shown in Figure 6. This 
degree of similarity illustrates that similar genes or proteins from other species 
may be useful for various embodiments of the invention. The protein also has 
the sequence Arg-Asn-Arg-Arg 335 representing the consensus cleavage site 

10 (Arg-X-Lys/Arg-Arg) for the proprotein convertase furin (Molloy, S.S. et al. J. 
Biol. Chem., 267, 16396-16402, (1992)) which would result in cleavage 
between positions 335 and 336. 

A comparison of sequences between related genes from human, 
mouse, rat, chicken and zebrafish is shown at Figure 6B. Orthologs of the 

15 human HFE2A protein are found in mouse, rat and zebrafish (Fig. 6B). 
Sequence comparison shows that human HFE2A is >85% identical with the 
mammalian orthologs, and -45% identical to the fish ortholog. The 426 amino 
acid translation of HFE2A also shares significant sequence similiarity with the 
Repulsive Guidance Molecule (RGM or RGMA) of human (48% identity) and 

20 chicken (46% identity) (Fig. 6B). In humans there is also a third RGM-like 
gene, RGMB, whose biological function is currently unknown. 

HFE2A Polypeptide Function 

This invention teaches the important discovery that HFE2A is a critical 
25 regulator of iron metabolism. Exactly how it is involved in this process remains 

to be fully clarified. Important information about the activity of this protein can 

be determined based on the analysis of protein domains. 

The 426 aa LOC148738 gene product (Protein3) has structural 

features consistent with a GPI (Glycosylphosphatidylinositol) anchored protein 
30 (Fig 7a) including a N terminal hydrophobic signal peptide for access to the 

Endoplasmic Reticulum, where GPI anchor addition occurs (residues 1-35 

Fig. 7b) and a C terminal hydrophobic domaln/GPI addition signal sequence 
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(residues 403-425 Fig. 7c) (Udenfriend S and Kodukula K Ann Rev Biochem 
1995). The GP! predicting program big-Pi predictor predicts that this gene 
product is GPI modified, and that either amino acids 399 or 400 can act as the 
omega site (Eisenhaber et al., JMB, 292 (3), 741-758, 1999). Other features 
5 identified are a RGD - cell attachment tri amino acid motif (residues 98-100) 
(Ruoslahti, E. Annu. Rev. Cell Dev. Bioi. 12, 697-715 (1996)) and a partial von 
Willebtand factor type D domain, von Willebrand factor type D domains are 
required for multimerization of von Willebrand factor (residues 167-253) 
(Jorieux, S. et al. f Blood 1998 Dec 15;92(12):4663-70). In addition, the protein 

10 has a possible proteolytic cleavage site identified by sequence comparison 
with Chicken Repulsive Guidance Molecule (RGM) between residues 172 and 
173 (Fig. 7e) (Monnier, P.P. et al. Nature 419, 392-394, 2002). Also present 
but not shown are N-glycosylation and O-glycosylation sites which appear to 
be consistent with consensus sequences. 

15 Figure 8 shows a protein domain analysis of HFE2A from a 

nucleotide/amino acid perspective. Features of the sequence are illustrated 
therein. 

Further aspects of HFE2A Polypeptide function at the biochemical level 
may be determined by those skilled in the art using standard techniques. It is 
20 dear from the instant invention that HFE2A Polypeptide is directly involved in 
iron transport, as the absence of this gene in humans leads to the condition of 
juvenile hemochromatosis (type 2A) or HFE2A. 

Sixteen human tissues were examined for HFE2A expression by 
probing Northern blots with a probe from exon 4. Significant HFE2A 
25 expression was detected in liver, heart (especially atria) and skeletal muscle 
(data not shown). 

Based oh publicly available ESTs (expressed sequence tags), 
fragments of this gene has also been found to be expressed in the following 
tissues: 

30 Human HFE2A: skeletal muscle, liver, sciatic nerve, Liver and Spleen, 

prostate, heart; 
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Mouse HFE2A: mammary gland, heart tongue, liver, xiphoid cartilage 
,medulla oblongata 

Rat HFE2A: brain, ovary, placenta, kidney, lung, liver, heart, muscle, 

spleen 

5 Therefore the HFE2A gene appears to be widely expressed in many 

tissues, but at reasonably low levels. 

While not wishing to be bound to any particular mechanism of action of 
how HFE2A mutation causes juvenile hemochromatosis, or leads to adult 
onset hemochromatosis, it is possible to speculate on different roles the 

10 gene/protein of the invention may play in iron metabolism. For instance, the 
HFE2A protein may lie upstream or downstream of hepcidin in the iron 
metabolism pathway. One possibility is that the protein is the hepcidin 
receptor or a co-factor for the hepcidin receptor. Another possibility is that 
HFE2A functions in the transcription, translation and/or processing of 

15 hepcidin, and is necessary for proper functioning of hepcidin. HFE2A may be 
a hepcidin interacting protein. An alternate function is that the HFE2A protein 
may be a secreted factor from the bone marrow or liver, acting in a paracrine 
fashion, as a hormone or hormone-like compound regulator of hepcidin levels. 
These possible mechanisms of action are suggestive for further 

20 exploration of the protein, but are not intended to be limiting on the scope of 
claims included below. In all of these scenarios, the functional consequences 
are the same, i.e. loss of function of HFE2A results in loss of the bod/s capacity 
to control iron metabolism through normal hepcidin function, thus precipitating 
the development of juvenile hemochromatosis, or adult onset hemochromatosis 

25 (possibly where other factors in the hepcidin/HFE2A metabolic pathway are not 
fully functional). Thus, modulating the activity of HFE2A is a method to regulate 
serum levels, biodistribution, and/or metabolism of iron in the body. 

Modulation of HFE2A is also a method for treating diseases which are 
not directly related to iron or iron metabolism. A function for HFE2A unrelated to 

30 its proposed role in iron metabolism is that of regulating blood volume. HFE2A 
is highly expressed in heart, especially in the right atrium, and therefore it is 
possible that HFE2A plays a hematological role, such as in sensing circulating 
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blood volume and the regulation of blood volume. Alternatively, HFE2A may 
play a paracrine function in the heart protecting local myocytes from iron related 
disease. Compounds which modulate the activity of HFE2A are therefore useful 
in treating or preventing hematological disorders, cardiac disorders (including 
5 but not limited to cardiac failure, cardiomyopathy, disorders of fluid imbalance, 
and arrhythmias), and hormonal disorders. Also important, because hepcidin 
levels are stimulated in bacterial infections, inflammation, response to LPS 
injection, etc, HFE2A is a suitable therapeutic target for mediating aspects of 
these disorders, and could be important in treatment of bacterial sepsis, etc. 

10 This invention establishes that the function of HFE2A is an essential 

regulator of iron in the human body, regardless of its exact mechanism of action. 
It is therefore a desirable drug target for treating diseases of iron metabolism in 
animals. For example, because HFE2A is directly implicated in JH and adult 
onset hemochromatosis according to this invention, the inventors recognize that 

15 JH and other diseases where iron metabolism is defective may be treatable by 
administering a chemical modulator of HFE2A. The disease may not necessarily 
be related to aberrant HFE2A gene or protein activity. For example, a compound 
which modulates HFE2A activity may compensate for insufficiencies in other 
aspects of the iron metabolism pathway. Or, a modulator of HFE2A may be 

20 used to treat a disease which is indirectly responding to iron levels in the blood. 

As used in this disclosure the phrase "disease of iron metabolism" is 
therefore to be construed in its broadest context This includes diseases where 
aberrant iron metabolism directly causes the disease, or where iron blood levels 
are disregulated causing disease, or where iron disregulation is a consequence 

25 of another disease, or where diseases can be treated by modulating iron levels, 
etc. More specifically, a disease of iron metabolism according to this disclosure 
• includes iron overload disorders, iron deficiency disorders, disorders of iron 
Wodistribution, other disorders of iron metabolism and other disorders potentially 
related to iron metabolism, etc. Even more specifically diseases of iron 

30 metabolism includes hemochromatosis, ferroportin mutation hemochromatosis, 
transferrin receptor 2 mutation hemochromatosis, juvenile hemochromatosis, 
neonatal hemochromatosis, hepcidin deficiency, transfusional iron overload, 
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thalassemia, thalassemia intermedia, alpha thalassemia, sideroblastic anemia, 
porphyria, porphyria cutanea tarda, african iron overload, hyperferritinemia, 
ceruloplasmin deficiency, atransferrinemia, congenital dyserythropoietic anemia, 
anemia of chronic disease, anemia, hypochromic microcytic anemia, iron- 
5 deficiency anemia, conditions with hepcidin excess, Friedriek's Ataxia, graciie 
syndrome, Hallervan Spate disease, Wilson's disease, pulmonary 
hemosiderosis, hepatocellular carcinoma, cancer, hepatitis, cirrhosis of liver, 
pica, chronic renal failure, insulin resistance, diabetes, atherosclerosis, 
neurodegenerative disorders, multiple sclerosis, Parkinson's Disease, 

10 Huntington's Disease, Alzheimer's Disease. 

in particular, a modulator of HFE2A can usefully treat anemia of chronic 
disease. Worldwide, anemia of chronic disease (ACD, also know as 
Inflammatory Anemia) is the second most common form of anemia. ACD 
represents the most common form of anemia in hospitalized patients. ACD is an 

1 5 acquired disorder seen in patients with a variety of conditions including infection, 
malignancy and chronic inflammation, it is characterized by retention of iron by 
macrophages and decreased intestinal iron absorption, which leads to reduced 
iron availability for erythropoiesis. Early in the course of their disease, patients 
with ACD have a mild, normocytic anemia due to impaired iron recycling. Later, 

20 due to the reduced intestinal absorption they become frankly iron deficient with a 
hypochromic microcytic picture. Initial studies examining the role of hepcidin in 
ACD show that excessive hepcidin production underlies ACD. 

Loss~of-function of murine hepcidin leads to severe iron overload, 
mimicking the biochemical and clinical phenotype of JH. In contrast, in both 

25 animal models and human disease, hepcidin overexpressbn leads to 
macrophage iron retention and an iron deficient phenotype typical of the iron 
disturbances found in ACD. Existing therapy for ACD is mainly targeted to 
treating the underlying disorder, with no efficacious treatment specifically 
directed to amelioration of the iron phenotype. Therefore, therapeutic strategies 

30 to reduce hepcidin production, such as by inhibition of HFE2A represent an 
important novel approach to treating the iron disturbances seen in ACD. 
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In some cases the diseases and disorders included in the definition of 
"disease of iron metabolism" are not typically identified as being iron related. It is 
recognized by the instant invention that based on the tissue distribution of 
HFE2A (hemojuvelin) and its related protein, hepcidin, that iron metabolism may 
5 play a significant role in these disease processes. For example, it has recently 
been shown that hepcidin is very highly expressed in the murine pancreas 
suggesting that diabetes (Type I or Type II), insulin resistance, glucose 
intolerance and other disorders may be ameliorated by treating underlying iron 
metabolism disorders: (See llyin, G. et al. 2003 Febs. Letters 542 22-26). As 
1 0 such, these diseases are encompassed under the broad definition. 

Method of Treatment Using HFE2A as a Therapeutic Target 

The discovery that mutations in HFE2A relate to clearly definable 

15 physiological outcomes in humans (namely juvenile hemochromatosis), now 
allows the inventors to establish, for the first time, that hemojuvelin, the HFE2A 
Gene and HFE2A Polypeptide are useful as therapeutic targets in humans for 
the treatment of diseases of iron metabolism. The words "therapeutic target" are 
intended to mean that therapeutic intervention is achieved with therapeutic 

20 agents that modulate the activity of the gene or protein. "Modulate" means to 
increase, to decrease, or to otherwise change the activity. Standard industrial 
processes are available to those skilled in the art to confirm the identity of the 
therapeutic agents which modulate the activity of the gene or protein, some of 
which are set out below. 

25 In accordance with the foregoing, the present invention relates to a 

method for treating a disorder comprising administering to a person in need of 
such treatment an effective amount of a selective HFE2A agonist or antagonist 
or a pharmaceutical^ acceptable salt thereof, or a pharmaceutical composition 
containing either entity. 

30 In a preferred embodiment, said disorder is a disease of iron metabolism. 

In a further preferred embodiment, said administering is by oral or intravenous 
means. 
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Use of the HFE2A in Screening Assays for Identifying Therapeutic Agents and 
Classes of Potential Therapeutic Agents. 

The present invention readily affords different means for identification 

of agents for treating diseases of iron metabolism according to their ability to 

5 modulate the activity of HFE2A. Exemplary assay methods useful for the 

identification of such compounds are detailed herein, although those skilled in 

the art will be aware of alternative means. 

In one series of embodiments described in some detail below, assay 

methods involve testing libraries of chemical compounds, either one at a time 

10 or in combinations, in an assay format designed to measure a biological 
activity related to HFE2A. Those library compounds that modulate the 
biological activity in the desired fashion are thereby identified as therapeutic 
agents of the invention. In effect, a wide variety of compounds are 
sequentially tested against the assay to determine whether they influence a 

15 measurable biological activity of the assay. Assays may be based one or 
more of the diverse measurable biological activities of a gene or polypeptide 
corresponding to HFE2A. 

"HFE2A activity 0 or "HFE2A biological activity" as used herein, 
especially relating to screening assays, is to be interpreted broadly and 

20 contemplates all directly or indirectly measurable and identifiable biological 
activities of the HFE2A gene, gene products and HFE2A Polypeptide. 
Relating to the purified HFE2A Polypeptide protein, HFE2A Polypeptide 
activity includes, but is not limited to, all those biological processes, 
interactions, binding behavior, binding-activity relationships, pKa, pD, enzyme 

25 kinetics, stability, and functional assessments of the protein. Relating to 
HFE2A Polypeptide activity in cell fractions, reconstituted cell fractions or 
whole cells, these activities include, but are not limited the rate at which 
HFE2A Polypeptide performs any measurable biological characteristic and all 
measurable consequences of these effects, including cell growth, 

30 development or behavior and other direct or indirect effects of HFE2A 
Polypeptide activity. Relating to HFE2A genes and transcription, HFE2A 
activity includes the rate, scale or scope of transcription of genomic DNA to 
generate RNA; the effect of regulatory proteins on such transcription, the 
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effect of modulators of such regulatory proteins on such transcription; plus the 
stability and behavior of mRNA transcripts, post-transcription processing, 
mRNA amounts and turnover, all measurements of expression and translation 
of the mRNA into polypeptide sequences, and all measurements of protein 
5 expression levels or dynamics. Relating to HFE2A activity in organisms, this 
includes but is not limited biological activities which are identified by their 
absence or deficiency in disease processes or disorders caused by aberrant 
HFE2A biological activity in those organisms. Some of the known or 
suggested biological activities are set out in the description of HFE2A 
10 provided in the specification, however those skilled in the art will be able to 
identify further measurable activities of HFE2A with routine techniques. 
Broadly speaking, HFE2A biological activity can be determined by all these 
and other means for analyzing biological properties of proteins and genes that 
are known in the art 

15 It is also recognized that those skilled in the art may prefer to use forms 

of HFE2A corresponding to the sequences disclosed herein, although not 
necessarily the same. For example, while screening assays preferably employ 
HFE2A from human, mouse, rat or fugu, other assays may utilize HFE2A from 
a different organism, preferably a vertebrate, and most preferably from a 

20 mammalian species. Thus the invention encompasses the use of, including 
but not limited to, sheep, dog, cow or horse HFE2A, for the same purposes as 
set out more specifically herein for human HFE2A. The shared technical 
features of these forms of HFE2A, are that, when expressed, they have 
similar biological activity, and that they share functional similarity with HFE2A, 

25 as the case may be, such as may be determined by those skilled in the art. 
The HFE2A gene and/or HFE2A Polypeptide according to the invention may 
also be obtained from other mammalian species, other vertebrates, 
invertebrates and microorganisms based on the disclosure herein. 

Thus, the polynucleotides for use in the screening assays of the 

30 invention that "correspond to" the polynucleotide encoding HFE2A (processed 
or unprocessed, including naturally occurring splice variants and alleles) are 
at least 40%, preferably at least 50%, more preferably at least 60%, 
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especially at least 70%, even more preferably at least 80%, or even at least 
85%, most preferably at least 90%, or even at least 95%, or most especially 
at least 98%, with the especially preferred embodiment of identical to, and 
especially having the sequence of, an RNA that would be encoded by, or be 
5 complementary to, such as by hybridization under reasonably stringent 
conditions, with an HFE2A polynucleotide (SEQ ID NO: 1-9, or 13-22.). In 
addition, HFE2A protein sequences encoding the same polypeptides as any 
of the nucleic acid sequences corresponding to HFE2A, regardless of the 
percent identity of such sequences, are also specifically contemplated by any 

10 of the methods of the present invention that rely on any or all of said 
sequences, regardless of how they are otherwise described or limited. Thus, 
any such sequences are available for use in carrying out any of the methods 
disclosed according to the invention. Such sequences also include any open 
reading frames, as defined herein, present within an HFE2A polynucleotide. 

15 Because of the processing that may take place in transforming the 

initial RNA transcript into the final mRNA, the sequences disclosed herein 
may represent less than the full genomic sequence. They may also represent 
sequences derived from alternate splicing of exons. Consequently, the genes 
present in the cell (and representing the genomic sequences) and the 

20 sequences disclosed herein, which are mostly cDNA sequences, may be 
identical or may be such that the cDNAs contain less than the full complement 
of exons found in the genomic sequence. Such genes and cDNA sequences 
are still considered corresponding sequences because they both encode 
similar RNA sequences. Thus, by way of non-limiting example only, a gene 

25 that encodes an RNA transcript, which is then processed into a shorter 
mRNA, is deemed to encode both such RNAs and therefore encodes an RNA 
corresponding to an HFE2A sequence as disclosed herein. (Those skilled in 
the art understand that the word "encode" and its derivatives mean, in this 
field "can be transcribed into" or "can be translated into"). Thus, the 

30 sequences disclosed herein correspond to genes contained in the cells and 
are used to determine relative levels of transcription because they represent 
the same sequences or are complementary to RNAs encoded by these 
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genes. Such genes also include different alleles and splice variants that may 
occur in the cells used in the processes of the invention. 

As used herein, the term "percent identity" or "percent identical," when 
referring to a sequence, means that a sequence is compared to a claimed or 
5 described sequence after alignment of the sequence to be compared (the 
"Compared Sequence") with the described or claimed sequence (the 
"Reference Sequence"). The Percent Identity is then determined according to 
the following formula: 

Percent Identity = 100[1-{C/R)] 

10 wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in 
the Reference Sequence that does not have a corresponding aligned base or 
amino acid in the Compared Sequence and (ii) each gap in the Reference 

15 Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
constitutes a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 
Sequence with any gap created in the Reference Sequence also being counted 

20 as a base or amino acid. 

If an alignment exists between the Compared Sequence and the 
Reference Sequence for which the percent identity as calculated above is about 
equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 

25 Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 

Where context indicates that sequences have been analyzed 
using the NCBI Blast tools, reports of sequence score, alignment, identity%, 
30 postive% and gaps% have been generated based on these calculations: For 
the segment pair, the alignment score is the sum of the scoring matrix values. 
The theory of amino acid substitution matrices is described in [1], and applied to 
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DNA sequence comparison In [2], The number of positives is the number of 
residues for which the alignment score has a positive value. Experimentation 
has shown that the BLOSUM-62 matrix [3] is among the best for detecting most 
weak protein similarities. [1] Aitschul, S.F. (1991) "Amino acid substitution 
5 matrices from an information theoretic perspective." J. Mol. Biol. 219:555-565. 
[2] States, D.J., Gish, W. & Aitschul, S.F. (1991) "Improved sensitivity of nucleic 
acid database searches using application-specific scoring matrices" Methods 
3:66-70. [3] Henikoff, S. & Henikoff, J.G. (1992) "Amino acid substitution 
matrices from protein blocks." Proc. Natl. Acad. Sd. USA 89:10915-10919. 

10 As used herein, the terms "portion," "segment," and "fragment," when 

used in relation to polypeptides, refer to a continuous sequence of nucleotide 
residues, sequence forms a subset of a larger sequence. Such terms include 
the products produced by treatment of said polynucleotides with any of the 
common endonudeases, or any stretch of polynudeotides that could be 

15 synthetically synthesized. These may indude exonic and intronic sequences of 
the corresponding genes. 

The instant invention provides numerous assays which measure an 
activity of HFE2A and are useful for the testing of compounds to identify 
which ones affect such activity. In terms of formatting the assays, the assays 

20 may use whole cells, cell extracts or reconstituted cell extracts, or purified or 
semi-purified proteins, or they may be larger scale tissue or whole animal 
assays. Common assays use measurements based on fluorescence, 
luminescence, radioactivity, or other measures of protein or gene transcript 
levels, amounts, or stability. Polypeptide assays may also include those 

25 wherein the HFE2A polypeptide is attached to a solid support, such as a resin 
or other polymer, especially where this is part of a column procedure for 
determining activity. Alternatively, batch procedures may also be used. 

In one aspect, the present invention relates to a method for identifying 
an agent that modulates the activity of an HFE2A polypeptide, comprising: 

30 a) contacting a test compound with an HFE2A polypeptide and under 

conditions supporting an activity of said polypeptide; and 
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b) determining a change in said activity of said HFE2A polypeptide as a 
result of said contacting, 

wherein a change in the activity identifies said test compound as an 
agent that modulates the activity of an HFE2A polypeptide. 
5 in a preferred embodiment, said gene has the sequence of SEQ ID NO: 

1-9. 

In a preferred embodiment, the observed change in activity in step (b) 
is a decrease or an increase in activity, most preferably wherein said change 
in activity is the result of binding to said polypeptide by said chemical agent of 
10 step (b). In a preferred embodiment, such agents are useful for treating a 
disease of iron metabolism. 

In other preferred embodiments, the polypeptide is part of an intact cell, 
preferably a mammalian cell, and which may be a recombinant cell. For HFE2A, 
cells of greatest interest include macrophages, hepatocytes, intestinal cells, 
15 heart cells (especially atrial cells), pancreatic ceil, skeletal muscle cell and cells 
of the bone marrow, although cells from other tissues may be employed. In one 
such embodiment said cell has been engineered to comprise said polypeptide, 
including by genetic engineering, especially where the cell does not possess the 
polypeptide absent said engineering. Thus, the present invention specifically 
20 contemplates embodiments in which the cell is engineered by other than genetic 
engineering, such as where the activity of a polypeptide is to be enhanced and 
the cell has been engineered to contain, or have on its surface, said polypeptide 
but wherein the polypeptide is present due to physical insertion of the 
polypeptide into the membrane or cytoplasm of the cell and not through 
25 expression of a gene contained in the cell. Methods well known in the art, such 
as use of polyethylene glycol, viruses, and the like, are available to effect such 
insertions and the details of such procedures need not be further described 
herein. 

In one preferred embodiment of such method, the polypeptide is a 
30 polypeptide that reacts with an antibody that reacts with, or is specific for, a 
polypeptide having an amino acid sequence at least 95% identical to, more 
preferably at least 98% identical to, the sequence of SEQ ID NO: 10 - 12 and 
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where any difference in amino acid sequence is due only to conservative amino 
acid substitutions. In an especially preferred embodiment, the polypeptide has 
the amino acid sequence of SEQ ID NO: 10 - 12. 

5 The HFE2A Polypeptide assays of the invention may employ 

compound screening technology such as (but not limited to) the ability of 
various dyes to change color, or fluorescent signaling/quenching in response 
to changes in assay conditions resulting from the activity of the polypeptides. 
Compound screening assays can also be based upon the ability of test 

10 compounds to modulate the interaction of the target peptide (HFE2A 
Polypeptide) and known or discovered interacting proteins or protein 
receptors. Such interacting proteins can be identified by a variety of methods 
known in the art, including, for example, radioimmunoprecipitation, co- 
immunoprecipitation, use of epitope-tagged constructs in pull down 

15 experiments, co-purification by extraction and chromatography techniques, 
and yeast two-hybrid screening. Such interactions can be further assayed by 
means including but not limited to direct detection methods such as surface 
plasmon resonance, fluorescence with fluorophores or fluorochromes and 
quenching with chromophores, radiolabeled proteins, fluorescence 

20 polarization, confocal fluorescence imaging or scintillation proximity 
techniques, indirect interactions may also be monitored through cell viability 
assays, second messenger reporting such as, G-protein coupled receptor 
assays, cAMP detection, nitric oxide synthase, phosphodiesterase activity, or 
lipid modification such as sphingomyelinase, inositol triphosphate assays. 

25 Agents that have the effect of modulating the half-life of HFE2A 

Polypeptide are also useful for treating diseases of iron metabolism. By way 
of non-limiting example, an assay for this kind of agent comprises cells 
expressing a wild-type HFE2A Polypeptide wherein such polypeptides are 
transiently metabolicaliy labeled during translation, contacted with a candidate 

30 compound, and the half-life of the polypeptide is determined using standard 
techniques. Compounds that modulate the half-life of the polypeptide are 
useful compounds in the present invention. 
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In one such assay for which the polypeptides encoded by genes 
disclosed herein are useful, the purified or semi-purified HFE2A polypeptide 
(or a fragment thereof or an epitope-tagged form or fragment thereof) is 
5 bound to a suitable support (e.g., nitrocellulose or an antibody or a metal 
agarose column in the case of, for example, a his-tagged form of said 
polypeptide). Binding to the support is preferably done under conditions that 
allow proteins associated with the polypeptide to remain associated with it. 
Such conditions may include use of buffers that minimize interference with 
10 protein-protein interactions. If desired, other proteins (e.g., a cell lysate) are 
added, and allowed time to associate with the polypeptide. The immobilized 
polypeptide is then washed to remove proteins or other cell constituents that 
may be non-specifically associated with it the polypeptide or the support. The 
immobilized polypeptide can then be used for multiple purposes. In a 
15 compound screening embodiment, such as that provided by Neogenesis 
Pharmaceuticals, Inc. (Cambridge, MA) the bound HFE2A Polypeptide will be 
employed in an automated ligand identification system, with low, medium or 
high-throughput capacity. In this case a pool of test compounds are exposed 
to HFE2A under conditions (i.e. buffers, temperatures, etc.) which promote 
20 specific binding of the test compounds to the protein. Compounds with non- 
specific binding are separated from the mixture. HFE2A/ligand complexes are 
then collected, and bound ligands are released and measured by mass 
spectrometer. A data analysis system correlates mass data with the list of 
compound masses included in the original test compound mixture. In an 
25 alternative embodiment, compounds can be tested for their ability to interfere 
with interactions between HFE2A Polypeptide and other bound molecules 
(which are presumably HFE2A Polypeptide interacting proteins). Compounds 
which can successfully displace interacting proteins are thereby identified as 
HFE2A Polypeptide modulating agents of the invention. Other well known 
30 protein binding assays, which use purified or semi-purified target protein, can 
also be employed to identify test compounds with specific binding affinity for 
the protein. 
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In an alternative embodiment designed to identify HFE2A Polypeptide 
Interacting proteins, the immobilized polypeptide is dissociated from its 
support, and proteins bound to it are released (for example, by heating), or, 
5 alternatively, associated, proteins are released from the polypeptide without 
releasing the latter polypeptide from the support. The released proteins and 
other cell constituents can be analyzed, for example, by SDS-PAGE gel 
electrophoresis, Western blotting and detection with specific antibodies, 
phospho-amino acid analysis, protease digestion, protein sequencing, or 

10 isoelectric focusing. Normal and mutant forms of such polypeptide can be 
employed in these assays to gain additional information about which part of 
the polypeptide a given factor is binding to. In addition, when incompletely 
purified polypeptide is employed, comparison of the normal and mutant forms 
of the protein can be used to help distinguish true binding proteins. Such an 

15 assay can be performed using a purified or semipurified protein or other 
molecule that is known to interact with a polypeptide encoded by an HFE2A 
polynucleotide. 

This assay may include the following steps. 

1 . Harvest the encoded polypeptide and couple a suitable fluorescent 
20 label to it; 

2. Label an interacting protein (or other molecule) with a second, 
different fluorescent label. Use dyes that will produce different quenching 
patterns when they are in close proximity to each other versus when they are 
physically separate (i.e., dyes that quench each other when they are close 

25 together but fluoresce when they are not in close proximity); 

3. Expose the interacting molecule to the immobilized polypeptide in 
the presence or absence of a compound being tested for its ability to interfere 
with an interaction between the two; and 

4. Collect fluorescent readout data. 

30 
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An alternative assay for such protein interaction is the Fluorescent 
Resonance Energy Transfer (FRET) assay. This assay can be performed as 
follows. 

1. Provide the encoded protein or a suitable polypeptide fragment 
5 thereof and couple a suitable FRET donor (e.g.,. nitro-benzoxadiazole (NBD)) 

to it; 

2. Label an interacting protein (or other molecule) with a FRET 
acceptor (e.g., rhodamine); 

3. Expose the acceptor-labeled interacting molecule to the donor- 
10 labeled polypeptide in the presence or absence of a compound being tested 

for its ability to interfere with an interaction between the two; and 

4. Measure fluorescence resonance energy transfer. 

Quenching and FRET assays are related. Either one can be applied in 
a given case, depending on which pair of fluorophores is used in the assay. 

1 5 The description of the HFE2A provided herein teaches a wide variety of 

biological activities of HFE2A that may be useful for the development of low. 
medium or high-throughput screening assays. 

One useful biological activity that works for a variety of assays is ligand 
binding (i.e. assays which either inhibit or enhance HFE2A binding with a 

20 ligand). The hepcidin protein (OMIM: 606464; See Roetto, A. et al. 2003. 
Nature Genet 33: 21-22) may be a ligand of HFE2A. Those skilled in the art 
may find other ligands. Assays based on whole cells, cell extracts or purified 
proteins may be developed which, measure the capacity of a test compound 
to inhibit or enhance HFE2A binding with a specific ligand. 

26 Cell function assays can be designed. In these assays, a measurable 

cell function which is dependent on HFE2A activity can be measured to 
determine inhibition or enhancement by test compounds. Such functions 
include cell growth assays, such as where compounds are evaluated for their 
ability to influence axonal growth, in assays such as those described in PCT 

30 patent publication WO 02/051438, incorporated herein by reference in its 
entirety. Based on the presence of ROD and von Willibrand factor-like 
domains, HFE2A may be involved in cell-cell adhesion. An assay can be 
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designed which relies on the biological activity of HFE2A induced adhesion, 
and measures a test compound's ability to modulate adhesion. 

In other cell based assays, an HFE2A protein response is first 
calibrated. For example HFE2A may modulate release of iron by 
5 macrophages to transferrin. Or, HFE2A may modulate transport of iron across 
a CaCo2 membrane. Alternatively, HFE2A may induce a change in NRAMP2 
iron transport behavior in an assay. One may assay for compounds which 
modulate these induced responses to HFE2A and thereby find a modulator of 
a biological activity of HFE2A. Signaling assays can be developed based on 
10 possible functions of HFE2A as a signaling molecule. Iron transport assays 
that may be adapted include iron uptake in an HFE-CHO cell model for 
duodenal crypt cells. This can be adapted to assay in 96-well plates wherein 
test compounds will reduce iron uptake. Similarly, test compounds may inhibit 
59Fe-bound transferrin uptake in human hepatoma cells (HLF). Assays that 
15 may be suitable for adaptation include those described in Su. M.A. et al. 
Blood. Vol. 92 No. 6 (September 15), 1998: pp. 2157-2163; Nunez MT. et al. 
j Nutr 1996 Sep;126(9):2151-8; Tandy S, et al. J Biol Chem 2000 Jan 
14;275(2): 1023-9; or Feeney GP and Worwood M. Bfochim Biophys Acta 
2001 Apr 23;1538(2-3):242-51). HFE2A shares sequence similarity and 
20 similar structural domains with the human repulsive guidance molecule 
(RGM), (fig.3) and to chicken RGM-like molecule (fig5). RGMs are a family of 
proteins involved in neural development and immune response (see WO 
02/051438), though assignment of at least some of these functions to HFE2A 
protein must be considered doubtful, as protein and mRNA expression is not 
25 found in neural cells. In any case, related proteins are known to act locally on 
the cell membrane as receptors, initiating intracellular signals, and 
systemically as a cleaved soluble protein. Like the chicken RGM, HFE2A has 
a putative extracellular cleavage site suggesting that it acts as a soluble 
peptide. One proposed mechanism of action could be that this secreted 
30 molecule interacts with proteins on the basal-lateral surface of the intestinal 
epithelial cells, like hepcidin, and modulates the apical surface uptake of iron 
from diet. This interaction could be direct, at the basal surface of the intestinal 
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cells, or via an upstream interaction resulting in modulating iron uptake. This 
direct or indirect effect can be assayed in a cell culture system. 

An example of a compound screening assay which relies on an HFE2A 
induced response (i.e. an HFE2A biological activity) is a Caco2 HFE2A 
5 induced iron transport assay. Caco2 cells are a human colorectal carcinoma 
cell line and considered the standard for measuring intestinal absorption. 
These cells grown to confluence form distinct apical and basalatoral surfaces 
and tight junctions. Grown on permeable membranes in Transwell plate 
inserts, Caco2 cells become polarized with the basalatoral surface of the cell 
10 on the membrane and exposed to the B chamber solutes, while the apical 
side of the cell is exposed to solutes in the A chamber. This system is useful 
to measure active transport of iron through the intestinal epithelium from A to 
B, mimicking transport from the intestinal lumen through the cell to the blood. 
A response to HFE2A can be induced by adding HFE2A to the B (baso- 
1 5 lateral) chamber of the CaCo2 assay. Recombinant HFE2A can be produced 
in mammalian cells and the purified protein directly added to the B chamber. 
Alternatively, cultured cells can be transfected with HFE2A constructs and 
then express HFE2A protein. Appropriate cells for transfection would be 
HepG2 or RAW cells, human liver carcinoma cells or mouse macrophage 
20 cells, respectively. These two cell types are known to be involved in iron 
metabolism, and likely express HFE2A The media from these transfected 
"donor" cells containing HFE2A can now be applied to the B chamber of the 
Caco2 cells, "acceptor" of the conditioned media, and iron absorption from the 
A chamber can be monitored. The transfected donor cells could also be 
25 grown on the bottom of the B chamber surface, thus directly releasing HFE2A 
into the B chamber, within millimeters from the basal surface of the acceptor 
cells. 

Whether HF2A acts as a receptor, intracelfular signal molecule or 
secreted protein, this multicellular communication system can be applied to 
30 measure an HFE2A induced response. The assay may measure iron 
transport, iron uptake from the A chamber, iron absorption and/or cell 
secretion of iron to the B chamber. This cell communication system can 
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thereafter be applied to primary drug compound screening for iron modulation 
in intestinal absorption. 

Using whole cells or cell extracts, assays can be developed for 
compounds which increase or decrease GPI cleavage or secretion of HFE2A 
5 or for compounds which increase or decrease N-glycosylation of HFE2A. 
Assays that measure compounds which modulate post-translational modifiers 
of HFE2A can be used to identify potential therapeutic agents. 

Some assays preferably employ purified or semi-purified HFE2A 
protein assays. This protein may be the GPNinked form, or the cleaved, 
10 soluble form. I 125 labelled HFE2A may be useful for these assays. Such 
assays include aggregation, assays which are designed based on the 
tendency of HFE2A to aggregate or homodimerize via the RGD or von 
Willibrand factor-like domains. In this assay, compounds are tested for their 
ability to enhance or prevent aggregation of homophilic dimerization domains. 
15 In this case, the protein-protein interaction assays above can use two pools of 
HFE2A, each labelled with a different fluorophore. In some embodiments a 
GPI-cleaved form of HFE2A may be used with the GPI-linked form. An 
alternate semi-purified format includes yeast 2-hybrid and/or phage display 
formats, wherein HFE2A binding regions are incorporated into both bait and 
20 prey vectors. This format would provide a moderately high throughput 
screening assay, suitable for radioimmunoassay or plate format- 
In addition it is to be noted that HFE2A may interact with other known 
• proteins of the iron metabolic pathway: Transferrin, Tf receptor, Hfe, hepcidin. 
p97 or other iron transporters, and receptors thereof. This interaction is a 
25 useful activity which may be used as the basis for a screening assay, 

Additionally, drug screening assays can also be based upon 
polypeptide functions deduced upon antisense interference with the gene 
function. Intracellular localization of HFE2A, or effects which occur upon a 
change in intracellular localization of such proteins, can also be used as an 
30 assay for drug screening. 
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In accordance with the foregoing, the present invention provides the 
amino acid sequence of a protein, designated hemojuvelin or HFE2A, that is 
found in tissues of the human body (for example, SEQ ID NO: 10-12 from 
humans) and which is associated with hereditary transmission of juvenile 
5 hemochromatosis. In addition, several mutations, have been found in this 
sequence derived from individuals found to have juvenile hemochromatosis. 
Thus, agents that interact with hemojuvelin or HFE2A represent candidate 
compounds for evaluation as therapeutic agents of the invention. 

Another broad category of screening assays is transcription assays. 
10 These assays seek to identify compounds which modulate (i.e. increase or 
decrease) transcription of the HFE2A gene. Relating to transcription assays, 
in one aspect the present invention relates to a method for identifying an 
agent that modulates the activity of a polynucleotide whose transcription 
contributes to amelioration of a disease of iron metabolism, comprising: 
15 a ) contacting under physiological conditions a chemical agent with a 

polynucleotide corresponding to the promoter region of HFE2A (in a preferred 
embodiment, li ie promoter has the sequence of SEQ ID NO: 19); and 

b) detecting a change in the transcription of said polynucleotide as a 
result of said contacting; 
20 thereby identifying an agent that modulates transcription of said 

polynucleotide or gene activity. 

Such modulation is preferably a decrease or an increase in transcription. 
In preferred embodiments, such transcription is measured by measuring the 
amount of a transcription product In a convenient embodiment, the promoter 
25 region of HFE2A is operably linked to a reporter gene, that is, a gene whose 
transcription is conveniently measured (for example, reporter genes such as 
Green Fluorescent Protein, luciferase, chloramphenicol acetyl-transferase 
(CAT), and the like). 

In preferred embodiments, the polynucleotide whose transcription is to 
30 be measured or monitored is present in an intact cell, preferably a mammalian 
cell, most preferably a cell selected from among macrophages, hepatocytes, 
intestinal cells, hematopoietic cells, pancreatic cell, skeletal muscle cell and 
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other celis of the bone marrow, neurological, CNS or spinal cells, although 
cells from other tissues may also be used. In additional preferred 
embodiments, such an intact cell is a cell that has been engineered to 
comprise said polynucleotide, such as by genetic engineering, preferably 
5 wherein the cell does not express the subject gene or polynucleotide absent 
having been engineered to do so. Alternatively, non-recombinant cells can be 
employed, as for example, in cells of tissues or cell lines where the 
transcription of HFE2A is constitutive, or can be induced. 

In accordance with the disclosure herein, upstream untranslated 

10 regions and promoter regions of HFE2A Gene are readily obtained from SEQ 
ID No. 19 or 22 and other publicly retrievable sequence databases. Such 
genomic or untranslated regions may be included in plasmids comprising the 
identified gene, such as in assays to identify compounds which modulate 
transcription thereof. In one such assay, the upstream genomic region is 

15 ligated to a reporter gene, and incorporated into a transcription plasmid. The 
plasmid is transfected into a cell, and the recombinant cell is exposed to test 
compound(s). Those compounds which increase or decrease the expression 
of the reporter gene are then modulators of the gene/protein, and are 
considered therapeutic agents of the invention. 

20 The invention also claims recombinant cells engineered to express a 

polynucleotide or polypeptide as disclosed herein. The gene disclosed herein 
as being involved in HFE2A in an animal can be used, or a fragment thereof 
can be used, as a tool to express a protein, where such genes encode a 
protein, in an appropriate cell in vitro, or can be cloned into expression vectors 

25 which can be used to produce large enough amounts of protein to use in in 
vitro assays for drug screening. Alternatively, the expression construct may 
employ the genomic promoter region of HFE2A and link it to a gene, such as 
a reporter gene, whose expression level is easily measured. Expression 
systems which may be employed include baculovirus, herpes virus, 

30 adenovirus, adeno-associated virus, bacterial systems, and eucaryotic 
systems such as CHO cells. Naked DNA and DNA-liposome complexes can 
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also be used. The invention thus claims recombinant cell lines containing a 
heterologous HFE2A gene. 

Such recombinant cells may be used in transcription assays for 
analyzing the levels of transcription of HFE2A Gene or a suitable reporter 
5 gene after contacting said cells with agents that may have therapeutic 
properties. The levels of gene transcription can be quantified by Northern blot 
analysis or RT-PCR, or, alternatively, by measuring the amount of protein 
produced, by one of a number of methods known in the art, or by measuring 
the levels of biological activity of polypeptides encoded thereby or other 

10 genes. In this way, the gene transcription can serve as a marker, indicative of 
the physiological response of the cells to the agent. Accordingly, this 
response state may be determined before, and at various points during, 
treatment of the individual with the agent 

Recombinant cell lines are also preferred for the preparation of purified 

15 protein, if a purified protein assay is desired. Those skilled in the art are 
capable of producing recombinant cell lines and extracting protein fractions 
containing highly purified proteins. These samples can be used in a variety of 
binding assays to identify compounds which interact with the proteins. 
Compounds that interact are therapeutic agents of the invention, or analogs 

20 thereof. 

Target selectivity is an important aspect of the development of 
therapeutic agents. The present invention specifically contemplates the 
identification of chemical agents, especially small organic molecules, that 
agonize or antagonize the transcription of HFE2A Gene, as defined herein, or 

25 the activity of the HFE2A Polypeptide (such as SEQ ID NO: 10 - 12 from 
humans) encoded thereby, with high specificity and that have little or no effect 
on other genes and/or polypeptides. 

Thus, in one such preferred embodiment, the methods disclosed herein 
for identifying an agent that modulates, preferably inhibits, expression of a 

30 gene corresponding to HFE2A, preferably having the sequence of SEQ ID 
NO: 1-9 from humans, or on the activity of a polypeptide encoded thereby, 
comprises first identifying such agent and then testing such agent for effects 
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on expression or activity of at least one other gene or polypeptide, preferably 
a gene or polypeptide with important physiological consequences that are 
preferably not disturbed by therapeutic intervention, and demonstrating little or 
no effect. 

5 In another aspect, the invention provides a method for computationally 

identifying a compound of the invention. The method involves (a) determining 
the crystal structure of an active site of a HFE2A Polypeptide protein (i.e. 
through x-ray crystallography or other techniques); and (b) through 
computational modeling, identifying a compound which interacts with the 

10 active site, thereby identifying a compound, or its analog, as a compound 
which is useful for modulating the activity of such a polypeptide. This process 
is sometimes referred to as in silico screening. Sophisticated software for 
testing the probability of test compounds to interact with the target protein, 
which can test tens of millions of computer generated compounds, is available 

15 to those skilled in the art. 

Potential therapeutic compounds identified using the methods of the 
invention are usually tested in animal model systems to confirm the putative 
efficacy. Thus, in a further aspect, the present invention relates to a method 
for identifying a therapeutic agent, comprising: 

20 a) administering to an animal an agent found to have activity using an 

assay or screening method as disclosed herein, and 

b) detecting in said animal a change in iron metabolism due to said 
administering, 

thereby identifying an agent for the treatment a disease of iron 
25 metabolism. 

Those skilled in the art are aware that typical measurements of iron 
metabolism that may be modulated in animal models (i.e., as a result of 
treatment with a potential therapeutic agent) include transferrin saturation, 
hepcidin levels, radioactive iron uptake in the gut, liver iron content, whole 
30 body iron content, anemia indices. Specialized mouse models for study 
include anemic mice, iron overload mice, hemochromatosis mice (hfe/hfe), 
Hpx mice (hypotransferrinemic mouse), and others. 
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In a further aspect, the present invention relates to a method for treating 
a condition in an animal afflicted with a disease of iron metabolism comprising 
administering to said animal an effective amount of an agent first identified by an 
assay method of the invention. Preferably, said animal is a human patient, such 
as a patient afflicted with a disease of iron metabolism. 

The screening assays of the invention thus simplify the evaluation, 
identification and development of therapeutic agents for the treatment of 
diseases of iron metabolism. 

The invention also includes antibodies and immuno-reactive 
substances which target, interact with or bind to hemojuvelin, HFE2A 
Polypeptide or epitopes thereof. Polypeptides encoded by the 
polynucleotides disclosed herein can be used as an antigen to raise 
antibodies, including monoclonal antibodies. Such antibodies will be useful for 
15 a wide variety of purposes, including but not limited to, therapeutic agents, 
functional studies, drug screening assays, and/or diagnostic agents. 
Monitoring the influence of agents (e.g., small organic compounds) on the 
expression or biological activity of the HFE2A polypeptides identified 
according to the invention can be applied not only in basic drug screening, but 
20 also in clinical trials. For example, the effectiveness of an agent determined 
by a screening assay as described herein to increase or decrease gene 
transcription, protein levels, or biological activity can be monitored in clinical 
trails of subjects exhibiting symptoms of diseases of iron metabolism due to 
inadequate gene transcription, protein levels, or biological activity (for 
25 example, the individuals studied herein. Alternatively, the effectiveness of an 
agent determined by a screening assay to modulate transcription of HFE2A 
Gene, as well as structurally and functionally related genes, including genes 
with high homology thereto, and including protein levels, or biological activity 
can be monitored in clinical trials of subjects exhibiting decreased altered 
30 gene transcription, protein levels, or biological activity. In such clinical trials, 
the transcription or activity of the genes or polypeptides disclosed herein and, 
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preferably, other genes that have been impHcated in. for example, iron 
metabolism, can be used to ascertain the effectiveness of a particular drug. 

In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an 
5 agonist, antagonist, peptidomimetic, protein, peptide, antibody, nucleic acid, 
small molecule, or other drug candidate identified by the screening assays 
described herein) including the steps of (i) determining that a patient requires 
treatment for an iron metabolism disease or disorder; (») administering an 
effective amount of an agent identified using one of the screening assays 
10 disclosed herein; (iii) ascertaining an improvement in iron metabolism 
following said administration and (vi) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent 
may be desirable to decrease the expression or activity of gene or encoded 
polypeptide, i.e., to increase the effectiveness of the agent 
1 5 Where the patient is non-human, biopsy samples can be taken to show 

a decrease in gene transcription, such as by measuring levels of protein, 
mRNA, or genomic DMA post-administration samples and comparing the level 
of expression or activity of sakJ protein. mRNA. or genomic DNA in the pre- 
administration sample with that of the corresponding post administration 
20 sample or samples, thereby showing the effects of drug administration on one 
or more of the genes disclosed herein and concomitant reduction in problems 

with iron metabolism. 

Purified or semi-purified HFE2A protein, or fragments thereof, or 
proteins corresponding to HFE2A. and any biochemically modified versions 

25 thereof, are themselves therapeutic agents of the invention. Recombinant or 
non-recombinant forms of these proteins or fragments can be administered to 
persons in need thereof for the treatment of disorders, such as diseases of 
iron metabolism. Those skilled in the art are familiar with techniques for 
generating such agents, and for determining conditions of administration. 

30 Specific compounds which will modulate the gene expression or gene 

transcript levels in a cell of HFE2A Include antisense nucleic acids, ribozymes 
and other nucleic acid compositions which specifically hybridize with the 
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HFE2A gene (including. exons or introns of such genes, promoters. 3-tails, 
and the like). These specific compounds are compounds of the invention, and 
are useful for treating the diseases discussed previously. Design and 
manufacturing of such compounds are well known to those skilled in the art, 
5 Specific compounds which modulate the activity of HFE2A include 

antibodies (polyclonal or monoclonal) and modified antibodies or fragments of 
antibodies which specifically bind to an epitope of said polypeptide. These 
specific compounds are compounds of the invention, and are useful for 
treating the diseases previously discussed. Design and manufacturing of such 
10 compounds are well known to those skilled in the art. In particular, humanized 
antibodies tend to be preferred, such as those generated using techniques 
provided by Abgenix, Inc. (Freemont, CA), Medarex, Inc. (Princeton, NJ), 
Protein Design Labs, Inc. (Freemont, CA), Genentech (South San Francisco, 
CA), and others. 

15 Specific compounds which modulate the activity of HFE2A in the body 

include gene therapy vectors comprising all or a part of the HFE2A gene 
sequence or mutant HFE2A sequence. As is well known to those skilled in the 
art, gene therapy allows the delivery of HFE2A in an organism to cells where 
it is taken up and expressed, thus changing the level or amount of HFE2A 

20 Polypeptide protein in such cell. These vectors thereby modulate the activity 
of HFE2A in the body and are useful for the therapeutic indications disclosed 
herein. 

Specific compounds which modulate the activity of HFE2A in the body 
include small organic molecules. Such compounds may be naturally 

25 occurring, or they may be synthetic. Collections and combinatorial libraries of 
such compounds are widely available from commercial sources. As know to 
those skilled in the art, a screening assay, such as the assays disclosed in the 
instant specification, can be easily adapted to identify therapeutic agents 
which have the desired HFE2A modulating ability. Agonists, antagonists, or 

30 mimetics found to be effective at reducing disorders of iron metabolism may 
be confirmed as useful in animal models (for example, mice, chimpanzees, 
etc.). In other embodiments, treatment with a compound of the invention may 
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be combined with other therapeutic agents to achieve a combined, possibly 
even synergistic, effect- 
Candidate modulators may be purified (or substantially purified) 
molecules or may be one component of a mixture of compounds (e.g., an 
5 extract or supernatant obtained from cells). In a mixed compound assay, gene 
expression is tested against progressively smaller subsets of the candidate 
compound pool (e.g., produced by standard purification techniques, such as 
HPLC or FPLC) until a single compound or minimal compound mixture is 
demonstrated to modulate gene or protein activity or expression in a manner 
10 having therapeutic effects. 

Lead Optimization and Analog Development and Selection 

In general, novel drugs having therapeutic properties are identified 
from libraries, possibly large libraries, of both natural product or synthetic (or 

15 semi-synthetic) extracts or chemical libraries according to methods known in 
the art. Those skilled in the field or drug discovery and development will 
understand that the precise source of test extracts or compounds is not critical 
to the screening procedure(s) of the invention. Accordingly, virtually any 
number of chemical extracts or compounds can be screened using the 

20 exemplary methods described herein. Examples of such extracts or 
compounds include, but are not limited to, plant-, fungal-, prokaryotic- or 
animal-based extracts, fermentation broths, and synthetic compounds, as well 
as modification of existing compounds. Numerous methods are also available 
for generating random or directed synthesis (e.g., semi-synthesis or total 

25 synthesis) of any number of chemical compounds, including, but not limited 
to, saccharide-, lipid-, peptide-, and nucleic acid-based compounds. Synthetic 
compound libraries are commercially available from Brandon Associates 
(Merrimack, NH) and Aldrich Chemical (Milwaukee, WI). Alternatively, libraries 
of natural compounds in the form of bacterial, fungal, plant, and animal 

30 extracts are commercially available from a number of sources, including 
Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch Oceangraphics 
Institute (Ft. Pierce, FL), and PharmaMar, U.S.A. (Cambridge, MA). In 
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addition, natural and synthetically produced libraries are produced, if desired, 
according to methods known in the art, e.g., by standard extraction and 
fractionation methods. Furthermore, if desired, any library or compound is 
readily modified using standard chemical, physical, or biochemical methods. 
5 De-replication(e.flf., taxonomic de-replication, biological de-replication, 

and chemical de-replication, or any combination thereof) or the elimination of 
replicates or repeats of materials already known for their therapeutic activities 
may be employed whenever possible. 

When a crude extract is found to have therapeutic activities, further 
10 fractionation of the positive lead extract is possible to isolate chemical 
constituent responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful 
characterization and identification of a chemical entity within the crude extract 
having such therapeutic activities. The same assays described herein for the 
15 detection of activities in mixtures of compounds can be used to purify the 
active component and to test derivatives thereof. Methods of fractionation and 
purification of such heterogeneous extracts are known in the art. If desired, 
compounds shown to be useful agents for the treatment of pathogenicity are 
chemically modified according to methods known in the art. Compounds 
20 identified as being of therapeutic value are subsequently analyzed using any 
standard animal model of an iron metabolism disorder known in the art. 

In general, these screening methods provide a ready means for 
selecting either natural product extracts or synthetic compounds of interest 
from a large population (i.e. a chemical library, for example, one produced by 
25 combinatorial means) which are further evaluated and condensed to a few 
active core structures. Multiple analogs of such core structures may be 
developed and tested to identify those preferred analogs which have 
improved characteristics as therapeutic agents. 

Improved analogs may also include compounds with improved stability, 
30 biodistribution, pharmacokinetics or other desirable features for therapeutic 
agents which are not directly related to modulation of the therapeutic target 
In a preferred embodiment, the improved analog of the invention is effectively 
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delivered, either by physiological means or assisted means, to cells of the 
body expressing the HFE2A Polypeptide protein. Analog compounds are 
systematically screened to evaluate whether they modulate the identified 
biological activity and those that effectively do so are then therapeutic agents, 
5 or further analogs thereof, according to the invention. 

Therapeutic Agents and Uses Thereof 

Those skilled in the art are familiar with the necessary steps for pre- 
clinical and human clinical trials which are used to establish efficacy and 

10 safety of the new chemical entities and compounds first identified by the 
invention for use in treating the diseases mentioned herein. 

Compounds first identified as useful therapeutic agents using one or 
more of the assays of the invention may be administered with a 
pharmaceutically-acceptable diluent, carrier, or excipient, in unit dosage form. 

15 Conventional pharmaceutical practice may be employed to provide suitable 
formulations or compositions to administer such compositions to patients. 
Although oral administration is preferred, any appropriate route of 
administration may be employed, for example, intravenous, parenteral, 
subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, 

20 intraventricular, intracapsular, intraspinal, intrathecal, epidural, intracistemal. 
intraperitoneal, intranasal, or aerosol administration. Therapeutic formulations 
may be in the form of liquid solutions or suspension; for oral administration, 
formulations may be in the form of tablets or capsules; and for intranasal 
formulations, in the form of powders, nasal drops, or aerosols. 

25 Methods well known in the art for making formulations are found in, for 

example, Remington: The Science and Practic e of Pharmacy. (19th ed.) ed. 
A.R. Gennaro AR., 1995, Mack Publishing Company, Easton, PA. 
Formulations for parenteral administration may, for example, contain 
excipients, sterile water, or saline, polyalkylene glycols such as polyethylene 

30 glycol, oils of vegetable origin, or hydrogenated napthalenes. Biocompatible, 
biodegradable lactide polymer. lactide/glycolide copolymer, or 
polyoxyethylene-polyoxypropylene copolymers may be used to control the 
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release of the compounds. Other potentially useful parenteral delivery 
systems for agonists of the invention include ethylenevinyl acetate copolymer 
particles, osmotic pumps, implantable infusion systems, and liposomes. 
Formulations for inhalation may contain excipients, or example, lactose, or 
5 may be aqueous solutions containing, for example, polyoxyethylene-9-lauryl 
ether, glycocholate and deoxycholate, or may be ofly solutions for 
administration in the form of riasa! drops, or as a gel. 

Combination therapies are also contemplated by the inventors. An 
therapeutic agent identified by one of the screening methods disclosed herein 
10 may be administered along with another agent intended to treat a coincident 
conditions, such as where therapeutic and antitumor agents are given 
together or contemporaneously. 

The present invention also relates to a process that comprises a 
method for producing a product, such as by generating test data to facilitate 
15 identification of such product, comprising identifying an agent according to 
one of the disclosed processes for identifying such an agent (i.e., the 
therapeutic agents identified according to the assay procedures disclosed 
herein) wherein said product is the data collected with respect to said agent 
as a result of said identification process, or assay, and wherein said data is 
20 sufficient to convey the chemical character and/or structure and/or properties 
of said agent For example, the present invention specifically contemplates a 
situation whereby a user of an assay of the invention may use the assay to 
screen for compounds having the desired enzyme modulating activity and, 
having identified the compound, then conveys that information (i.e., 
25 information as to structure, dosage, etc) to another user who then utilizes the 
information to reproduce the agent and administer it for therapeutic or 
research purposes according to the invention. For example, the user of the 
assay (user 1) may screen a number of test compounds without knowing the 
structure or identity of the compounds (such as where a number of code 
30 numbers are used the first user is simply given samples labeled with said 
code numbers) and, after performing the screening process, using one or 
more assay processes of the present invention, then imparts to a second user 
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(user 2), verbally or in writing or some equivalent fashion, sufficient 
information to identify the compounds having a particular modulating activity 
(for example, the code number with the corresponding results). This 
transmission of information from user 1 to user 2 is specifically contemplated 
5 by the present invention. 

In accordance with the foregoing, the present invention relates to a 
method for producing test data with respect to the modulation of HFE2A gene 
expression by a compound, comprising: 

(a) a genetic construct comprising a reporter gene operably linked to 
10 an HFE2A promoter under conditions supporting expression of said reporter 

gene; 

(b) determining a change in expression of said reporter gene as a 
result of said contacting, wherein said change shows modulation, and 

(c) producing test data with respect to the gene modulating activity of 
15 said test compound based on a change in expression of the determined genes 

indicating gene modulating activity. 

In another such embodiment, tite present invention relates to a method 
for producing test data with respect to the modulation of an HFE2A 
polypeptide activity by a compound, comprising: 
20 (a) contacting a test compound with an HFE2A polypeptide and under 

conditions promoting an activity of said HFE2A polypeptide; 

(b) determining a change in activity of said polypeptide as a result of 
said contacting, wherein said change shows modulation, and 

(c) producing test data with respect to the HFE2A polypeptide modulating 
25 activity of said test compound based on a change in an HFE2A polypeptide 

activity indicating modulating activity. 

Diagnostics and Pharmacoaenomics 

In a further embodiment, the invention relates to diagnostic and 
30 pharmacogenomic compounds, kits and methods. This aspect relates to 
analysis HFE2A Gene (HFE2A, hemojuvelin gene) for the diagnosis, including 
prognosis of onset or severity of onset, of juvenile hemochromatosis, adult 
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onset hemochromatosis, a disorder of iron metabolism, or in the selection of a 
therapeutic agent for a patient (i.e. pharmacogenomics). It also relates to the 
use of HFE2A diagnosis to classify patients having or at risk of having a 
disease of iron metabolism. 
5 For example, nucleic acid analysis can be used to identify the HFE2A 

mutations disclosed herein, thus confirming the diagnosis of juvenile 
hemochromatosis. 

Using the nucleic acid sequences disclosed in this invention, both the 
wild-type (non-disease associated) sequences (SEQ ID No. 1 - 9) and the 
10 disease associated (mutated) sequences (SEQ ID Nos. 14, 16. and 18). those 
skilled in the art are capable of developing numerous different types of nucleic 
acid diagnostic methods, compounds and kits. Techniques include DNA 
sequencing, hybridization probing, single stranded conformational analysis, 
PCR based techniques such as mismatch amplification, and myriad other well 
15 known methods. All such analysis can be performed on a small sample of 
blood, saliva, urine or other tissue provided by the patient 

Alternatively, using the protein sequences disclosed in this invention 
(SEQ ID No. 10 - 12) protein based analyses such as antibody based assays 
(Elisa, Radioimmunoassay and the like) can be employed to identify the 
20 expression, amount or presence or absence of a normal or mutant HFE2A 
protein (HFE2A Polypeptide), such as those mutant polypeptides that result 
from the mutations disclosed herein. 

Gene transcription, both comparable and absolute, as well as biological 
activity, and mutational analysis can each serve as a diagnostic tool for a 
25 disease of iron metabolism; thus determination of the amount of HFE2A 
mRNA can be used to diagnose the presence or absence of a mutation 
correlated with such a disease. 

Based on the instant invention, those skilled in the art will also be able 
to develop other biochemical, chemical and diagnostic assays of HFE2A, 
30 mutation which are suitable for use with animal tissue samples. 

A valuable embodiment of the invention will be to use the diagnostic 
assays to classify patients having or at risk of having a disease of iron 
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metabolism. There are many risk factors for diseases of iron metabolism, 
such as enhanced serum ferritin levels, transferrin receptor saturation, or 
identification of a mutation in a gene for hemochromatosis (i.e. the Hfe gene). 
Not all of these risk factors lead to the development of a disease of iron 

5 metabolism. Using the teaching of the invention, it is now possible to take 
patients at risk of having a disease of iron metabolism based on a known risk 
factor and further assessing them for mutations in HFE2A, wherein a mutation 
in HFE2A (in one or both copies of the gene) indicates a statistically greater 
chance of developing toe disease of iron metabolism. This discovery 

10 particularly aids in diagnosis and prognosis of the risk of adult onset 
hemochromatosis, which correlates only loosely with the known mutations in 
the Hfe gene. 

What follows is an example of the diagnostic benefit of HFE2A in 
relation to adult onset hemochromatosis. Nucleic acid or protein analysis used 

15 to identify HFE2A mutations disclosed herein, could be used to predict onset 
or severity of adult hemochromatosis. Adu» hemochromatosis displays a wide 
variability in clinical penetrance. In some numan subjects homozygous for 
HFE mutations there is eariy onset of a severe phenotype. while in other 
subjects homozygous for HFE mutations, they remain clinically normal. 

20 Assessment of HFE2A gene may be informative for which patients will 
manifest adult disease and which ones will remain relatively symptom free. 

A patient was found to be heterozygous for a mutation in the Hfe1 gene 
at H63D. This mutation is considered a mild mutation which does not always 
result in onset of hemochromatosis. A second diagnostic test was then 

25 performed on the HFE2A gene. It revealed the following homozygous 
mutation: 



Mutation in HFE2a 


Type 


Sequence 


C.569OAP-A190D 


WT 


dtccaaqqagCttqqcctcta (SEQ ID NO. 31 ) 




MT 


otccaaqqaqAttaqcctcta (SEQ ID NO. 32) 



The patient developed adult onset hemochromatosis. Thus, detection of 
30 mutations in HFE2A is useful in identifying and classifying patients at risk of 
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developing hemochromatosis, because of a known risk factor (such as Hfe1 
mutation), and determining which patients will develop the disease. 

This invention therefore discloses a method of classifying a patient at 
risk of developing a disease of iron metabolism, comprising 

5 a) performing a first assessment for a disease of iron metabolism, and 

b) performing a diagnostic assay on HFE2A, 

wherein a finding of mutation in HFE2A classifies said patient as 

having a statistically higher risk of developing said disease. 

Pharmacogenomics deals with clinically significant hereditary variations 
10 in the response to drugs due to altered drug disposition and abnormal action 
in affected persons (Eicheibaum, M., Clin. Exp. Pharmacol. PhysioL, 23:983- 
985, 1996; Under, M. W., Clin. Chem., 43:254-266, 1997). In general, two 
types of pharmacogenetic conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act on the body (altered 
15 drug action) or genetic conditions transmitted as single factors altering the 
way the body acts on drugs (altered drug metabolism). Altered drug action 
may occur in a patient having a polymorphism (e.g., an single nucleotide 
polymorphism or SNP) in promoter, intronic, or exonic sequences of HFE2A 
Gene. Thus by determining the presence and prevalence of polymorphisms 
20 allow for prediction of a patients response to a particular therapeutic agent. 

This pharmacogenomic analysis can lead to the tailoring of drug 
treatments according to patient genotype, including prediction of side effects 
upon administration of therapeutic agents, particularly therapeutic agents for 
treating disorders disclosed in this specification. Pharmacogenomics allows 
25 for the selection of agents (e.g., drugs) for therapeutic or prophylactic 
treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual is examined to determine the ability of the individual 
to respond to a particular agent). 
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Diagnostics employing a gene or protein corresponding to HFE2A 
Gene (HFE2A) can also be useful in selecting patients for clinical trials of a 
potential therapeutic agent Patients can be stratified according to the DNA or 
protein sequence of HFE2A Gene and their response to drug treatment can 
5 be evaluated. Such stratification can greatly reduce the number of patients 
required to establish efficacy for a potential therapeutic agent 

The invention thus comprises compounds, reagents and kits which are 
designed to identify the presence or absence of mutations in the HFE2A 
gene. 

10 Example 1 

Identification of the genetic mutation responsible for juvenile 
hemochromatosis 2A (HFE2A). 

3. We have collected ten Greek families including 13 individuals 

15 with JHH. Pedigrees of these families are shown in Figures 9a-9d. Of these 
families (JH3-12), five have been reported previously (JH3-7) (Papanikolaou, 
G. et al. Linkage to chromosome 1q in Greek families with juvenile 
hemochromatosis. Blood Cells Mol. Dis. 27, 744-749 (2001)). Only one family, 
JH7, is known to be consanguineous. We confirmed that the disease is 

20 consistent with linkage to 1q21 (HFE2A; OMIM 602390) in the new families by 
using 25 genotype markers, including six new microsatellite markers identified 
from genomic sequence. While the Build 31 human genome sequence 
assembly contains gaps and duplications, we were able to estimate the size 
of the linkage interval, and define the linkage boundaries and gene content 

25 based on existing sequence contigs. Homozygosity mapping in JH7 defined 
the limits of the candidate region in an -1.7 Mb interval between markers 
CA3AL590452 (141595604-141595847 bp) and CA1AL596177 (143293543- 
143293826 bp). (The primer sequences for each marker are in Table 6). 

Homozygosity in this region was also observed in affected individuals 

30 from seven families without known consanguinity. We obtained six different 
Greek haplotypes by constructing haplotypes using an additional 12 markers. 
Affected individuals from families JH3, 5, 6, 7, 10, 1 1 and 12 shared alleles for 
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at least 6 of 7 markers in the linked interval. Affected individuals from JH4 and 
8 displayed different homozygous haplotypes. 



Table 6. 



D No. 


Forward Primer 


Reverse Primer 


CA3AL 
590452 


cacttgagcecaggaatttg 
(SEQ ID NO: 55) 


gactcactgcagccttgacc 
(SEQ ID NO: 56) 


CA1AL 
596177 


gtgtgctacaagtttgccgaat 
(SEQ ID NO: 57) 


gcttgaaactgggagttgga 
(SEQ ID NO: 58) 


CA1AL 
355505 


gggaaatggtcccataattcct 
(SEQ ID NO: 59) 


cgccctgccAATATGTTCT 
(SEQ ID NO: 60) 


D1S44 


GGTACTTAGCCTCGAAA 
TGAGA (SEQ ID NO: 61) 


GTGTCACACAACTGGTT 
GGT (SEQ ID NO: 62) 



Identification of Mutations in HFE2A 

Mutations in HFE2A, (LOC148738) ENSG00000168509, transcript 
ENST00000306561. Ensembl v.12.31.1, November.2002. 

We first sequenced one of three coding exons (ENSE00001 277351, 
10 v.12.31.1) in this gene. Sequencing was initially limited to those probands that 
probably carried unique haplotypes, from families JH3 and 4. The first 
mutation identified was in JH4-203, a T to C transition in the 3' exon 
(c.842T>C, p.l281T). JH4-203 is homozygous for this SNP. Two siblings and 
one child, all unaffected, are heterozygous for this SNP. This SNP was not 
15 present in any other JH probands or family members and not present in 74 
European and 74 Greek control chromosomes. 

The second genetic variation identified within this exon is a G to T 
transversion (c.959G>T, p.G320V). This SNP was homozygous in the JH3 
proband. The JH3 affected is homozygous for the C.959T allele and one 
20 unaffected sib is heterozygous for this SNP. This SNP was not present in 128 
European and 37 Greek control chromosomes. After sequencing all family 
members of the 10 JH families, it was determined that this SNP is 
homozygous in the affected individuals from families JH5, JH6, JH7, JH10, 
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and heterozygous in the JH11 proband and both parents of the JH12 
proband. 

A third variation, a deletion at nucleotide 1079 (c.1079delC, 
p.C361fsX366) was identified in the homozygous state in the JH8. This results 
5 in 5 unique amino acids and a premature stop. This SNP was also present in 
the JH1 1 proband, indicating that this individual is a compound heterozygote. 

Additional information about the sequences disclosed in the SEQ ID listing in 
this specification: 

10 

I OC148738 Exons 

Exons 1, 3a, 3b and 4 may also be found at Gene ENSG000001 68509 
(Ensembl) and REFSEQ NM_1 45277.1 (NCBI). Exon 2 may be inferred from 
an ensembl prediction based on the sequence AK098165.1. 
15 To simplify sequence searching, exon sequence coordinates are 

provided below based on the NCBI 31 assembly of the human genome. The 
SEQ ID NOs for these are: 

SEQ ID NO. 1 - Exonl ENSE00001 155188 

Start: 142000393, End:142000541 Chr: 1 
20 SEQ ID NO. 2 - Exon2 

Start 142001790, End: 142001993 Chr.1 

SEQ ID NO. 3 - Exon3a ENSE00001 155182 

Start: 142002394. End: 142002430 Chr1 

SEQ ID NO. 4 - Exon3b ENSE00001 277320 
25 Start 142002394, End: 142002953 Chr 1 

SEQ ID NO. 7 - Transcript 2 ENST00000317920 

Exons 1 , 3a, 4, translation_start 208 

SEQ ID NO. 8— Transcript 3 Exons 1 , 3b, 4 translation_start: 392 
SEQ ID NO. 9 - Transcript 4 Exons 1 , 2, 3b, 4 translation_start: 257 
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LOC148738 Translated Proteins 

Proteinl is the protein translation of Transcript 1 and Transcript 2 and 
is similar to the Ensembl protein ENSP00000320072 and the NCBI protein 
NP_660320.1. Protein 2 is the protein translation of Transcript 2 and may be 
5 found in the protein translation of AK098165.1. Protein 3 is similar to the 
predicted Ensembl Protein ENSP00000304614. SEQ ID NOs are as follows: 

SEQ ID NO. 10- Proteinl ENSP00000320072 

Translated from Transcript 1 and Transcript 2 

SEQ ID NO. 11- Protein 2 - 
1 0 Translated from Transcript 3 

SEQ ID NO. 12 - Protein 3 ENSP00000304614 

Translated from Transcript 4 

HFE2A Promoter Sequences for Human, Mouse and Rat 

15 

For SEQ ID Nos. 19-21, the promoter sequence is deemed to begin 1500 bp 
5' of the transcriptional start site. 

Human Genomic DNA fragment including HFE2A promoter region and 
exons have SEQ ID NOs as follows: 

20 S£ Q ,D NO. 22 - Human Genomic DNA fragment including HFE2A 

promoter region and exons 

SEQ ID NO. 23 - similar to gi|24308189|refjNP_064596.1| Human 
RGM 

SEQ ID No. 24 - Similar to ENSESTP00000023393 Human HFE2A 
25 paralog on chromosome 5, sequence starting from first methionine 

SEQ ID NO. 25 - Similar to ENSMUSESTP00000016634 Mouse 
HFE2A 

SEQ ID No. 26 - Similar to gi|21758120|dbj|BAC05248.1| Rat HFE2A 
annotated as from Homo sapiens but nucleotide sequence identified as 
30 from rat 

SEQ ID No. 27 - Similar to SINFRUP00000138308 Fugu HFE2A, N- 
terminus not defined 
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SEQ ID No. 28 - Similar to gi|22651834|gb|AAM95449.1| Chicken 
RGM 

Those skilled in the art will be familiar with numerous variations of the 
5 compositions and methods disclosed above, all of which are encompassed by 
the claims further set out below. 
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10 



WHAT IS CLAIMED IS: 

1. A method for identifying an agent that modulates HFE2A gene 
expression, comprising: 

(a) contacting a test compound with a genetic construct comprising a 
reporter gene operabiy linked to an HFE2A promoter under conditions 
supporting expression of said reporter gene; 

(b) determining a change in expression of said reporter gene as a 
result of said contacting, 

wherein a change in expression identifies said test compound as an 
agent that modulates HFE2A gene expression. 

2. The method of claim 1 wherein said modulation is an increase in 
expression of said reporter gene. 

3. The method of claim 1 wherein said modulation is a decrease in 
15 expression of said reporter gene. 

4. The method of claim 1 wherein said HFE2A promoter is a 
mammalian HFE2A promoter. 

5. The method of claim 4 wherein said mammal is a member selected 
from the group consisting of mouse, rat and human. 

6. The method of claim 1 wherein said promoter has the nucleotide 
sequence of SEQ ID NO: 19. 

7. The method of claim 1 wherein said genetic construct is in a cell. 

8. The method of claim 7 wherein said cell is a mammalian cell. 

9. The method of claim 7 wherein said cell is a recombinant cell 
25 engineered to express said gene and wherein said cell does not express said 

gene absent said engineering. 

10. The method of claim 8 wherein said mammalian cell is a 
macrophage, inflammatory cell, liver cell, hepatocyte, intestinal cell, 
hematopoietic cell, pancreatic cell, skeletal muscle cell or a nervous system 

30 cell. 



20 
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11. The method of claim 1 wherein said test compound is useful in 
controlling iron metabolism. 

12. The method of claim 1 wherein said test compound is first identified 
as an agent that modulates HFE2A gene expression using said method. 

5 13. The method of claim 1 wherein said change is determined by 

measuring a change in an enzyme activity. 

14. A method for identifying an agent that modulates HFE2A gene 
expression, comprising: 

a) contacting a test compound with a cell expressing an HFE2A gene; 
10 b) determining a change in expression of said HFE2A gene as a result 

of said contacting, 

wherein a change in expression identifies said test compound as an 
agent that modulates HFE2A gene expression. 

15. A method for identifying an agent that modulates the activity of an 
1 5 HFE2A polypeptide, comprising: 

a) contacting a test compound with an HFE2A polypeptide under 
conditions supporting an activity of said polypeptide; and 

b) determining a change in said activity of said polypeptide as a result 
of said contacting, 

20 wherein a change in the activity identifies said test compound as an 

agent that modulates the activity of an HFE2A polypeptide. 

16. The method of daim 15 wherein said HFE2A polypeptide is a 
mammalian HFE2A polypeptide. 

17. The method of claim 16 wherein said mammal is a member 
25 selected from the group consisting of mouse, rat and human. 

18. The method of claim 15 wherein said change is a decrease in said 
activity. 

19. The method of claim 15 wherein said change is an increase in said 
activity. 

30 20. The method of claim 1 5 wherein said polypeptide is part of a cell. 

21 . The method of daim 20 wherein said cell is a mammalian cell. 
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22. The method of claim 20 wherein said cell has been engineered to 
contain said polypeptide/ 

23. The method of claim 22 wherein said cell was engineered by 
genetic engineering. 

5 24. The method of claim 23 wherein said cell does not contain said 

polypeptide absent said engineering. 

25. The method of claim 20 wherein said cell is a macrophage, 
inflammatory cell, liver cell, hepatocyte, intestinal cell, hematopoietic cell, 
pancreatic cell, skeletal muscle cell or a cell of the nervous system, 
10 26. The method of claim 15 wherein said HFE2A polypeptide is highly 

purified. 

27. The method of claim 15 wherein said HFE2A polypeptide is a semi- 
purified polypeptide. 

28. The method of claim 15 wherein said HFE2A polypeptide is 
1 5 attached to a solid surface. 

29. The method of claim 28 wherein said solid surface is a resin. 

30. The method of claim 15 wherein said HFE2A polypeptide is part of 
an extract. 

31. The method of claim 15 wherein said polypeptide is encoded by a 
20 polynucleotide having the sequence selected from the group consisting of 

SEQ ID NO: 1 , 2, 3, 4, 5, 6, 7, 8 and 9. 

32. The method of claim 15 wherein said polypeptide comprises an 
amino acid sequence selected from SEQ ID NO: 10, 11 and 12. 

33. A method for treating a disorder comprising administering to an 
25 animal afflicted therewith a therapeutically effective amount of an HFE2A 

modulator. 

34. The method of claim 33 wherein said HFE2A modulator exhibits 
modulating activity in a method of claim 1 , 14 or 15. 

35. The method of claim 34 wherein said modulator was first identified 
30 as an HFE2A modulator using said method. 
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36. The method of claim 33 wherein said HFE2A modulator is a 
member selected from the group consisting of a selective HFE2A agonist, a 
selective HFE2A antagonist, pharmaceutical^ acceptable salts thereof, and 
combinations of these. 
5 37. The method of claim 36 wherein said member is in a 

pharmaceutically acceptable carrier. 

38. The method of claim 33 wherein said HFE2A modulator is selected 
from an antibody, an anti-sense molecule, a ribozyme and a drug-like small 
organic molecule. 

10 39. The method of claim 33 wherein said disorder is a disease of iron 

metabolism. 

40. The method of claim 39 wherein said disease of iron metabolism is 
a member selected from the group consisting of an iron overload disorder and 
an iron deficiency disorder. 
15 41. The method of claim 39 wherein said disease of iron metabolism is 

a member selected from the group consisting of hemochromatosis, 
transfusion iron overload, thalassemia, porphyria, and juvenile 
hemochromatosis. 

42. The method of claim 39 wherein said disease of iron metabolism is 
20 a member selected from the group consisting of anemia and anemia of 

chronic disease. 

43. The method of claim 39 wherein said disease of iron metabolism is 
a member selected from Type I diabetes, Type II diabetes and insulin 
resistance. 

25 44. A method for preventing . a disorder comprising administering to an 

animal, at risk of developing said disorder, a therapeutically effective amount 

of an HFE2A modulator. 

45. The method of claim 44 wherein said HFE2A modulator exhibits 

modulating activity in a method of claim 1 , 14 or 15. 
30 46. The method of claim 45 wherein said modulator was first identified 

as an HFE2A modulator using said method. 
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47. The method of claim 44 wherein said HFE2A modulator is a 
member selected from the group consisting of a selective HFE2A agonist, a 
selective HFE2A antagonist, pharmaceutical^ acceptable salts thereof, and 
combinations of these. 
5 48. The method of claim 47 wherein said member is in a 

pharmaceutically acceptable carrier. 

49. The method of claim 44 wherein said HFE2A modulator is selected 
from an antibody, an anti-sense molecule, a ribozyme and a drug-like small 
organic molecule. 

10 50. The method of claim 44 wherein said disorder is a disease of iron 

metabolism. 

51. The method of claim 50 wherein said disease of iron metabolism is 
a member selected from the group consisting of an iron overload disorder and 
an iron deficiency disorder. 
1 5 52. The method of claim 50 wherein said disease of iron metabolism is 

a member selected from the group consisting of hemochromatosis, 
transfusion iron overload, thalassemia, porphyria, and juvenile 
hemochromatosis. 

53. The method of claim 50 wherein said disease of iron metabolism is 
20 a member selected from the group consisting of anemia and anemia of 

chronic disease. 

54. The method of claim 50 wherein said disease of iron metabolism is 
a member selected from Type I diabetes. Type II diabetes and insulin 
resistance. 

25 55. A method to diagnose individuals affected by or at risk of 

developing a disease of iron metabolism comprising determining a mutation or 
polymorphism in the nucleic acid sequence of the HFE2A gene in said 
individual wherein said mutation or polymorphism in said gene identifies said 
individual as being affected by or at risk of developing a disease of iron 

30 metabolism. 
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56. The method of claim 55 wherein said mutation or polymorphism is 
a mutation or polymorphism of Table 1 . 

57. A method to diagnose individuals affected by or at risk of 
developing a disease of iron metabolism comprising determining a mutation or 

5 polymorphism in the amino acid sequence of HFE2A polypeptide in said 
individual wherein a mutation or polymorphism of said HFE2A polypeptide 
identifies said individual as being affected by or at risk of developing a 
disease of iron metabolism. 

58. A method to diagnose individuals affected by or at risk of 
10 developing a disease of iron metabolism comprising determining a decrease 

in level or amount of an HFE2A polypeptide in said individual wherein said 
decrease identifies said individual as being affected by or at risk of developing 
a disease of iron metabolism. 

59. The method of claim 55, 56, 57 or 58 wherein said disease of iron 
1 5 metabolism is adult onset hemochromatosis. 

60. The method of claim 55, 56, 57 or 58 wherein said disease of iron 
metabolism is juvenile hemochromatosis. 

61. The method of daim 55, 56, 57 or 58 wherein said disease of iron 
metabolism is selected from Type I diabetes, Type II diabetes and insulin 

20 resistance. 

62. A method for identifying a compound capable of modulating a 
HFE2A activity, comprising: (a) contacting a test compound with a cell that 
expresses HFE2A; and (b) assaying the ability of said test compound to 
modulate the transcription of a HFE2A nucleic acid or the activity of HFE2A 

25 polypeptide, thereby identifying a compound capable of modulating a HFE2A 
activity. 

63. The method of daim 62, wherein the compound is an anti-HFE2A 
polypeptide antibody. 

64. The method of daim 62, wherein the compound is an antisense 

30 HFE2A nucleic acid molecule. 
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65. The method of claim 62, wherein the compound is a HFE2A 
ribozyme. 

66. The method of claim 62, wherein the compound is a small organic 
molecule. 

5 67. An isolated polynucleotide comprising a polynucleotide having a 

nucleotide sequence with at least 60 percent identity to a sequence selected 
from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 19 f 20, 21 
and 22. 

68. The isolated polynucleotide of claim 67 wherein said percent 
1 0 identity is at least 70 percent. 

69. The isolated polynucleotide of claim 67 wherein said percent 
identity is at least 78 percent. 

70. The isolated polynucleotide of claim 67 wherein said percent 
identity is at least 90 percent 

15 71. The isolated polynucleotide of claim 67 wherein said percent 

identity is at least 95 percent. 

72. The isolated polynucleotide of claim 67 wherein said percent 
identity is at least 98 percent. 

73. The isolated polynucleotide of claim 67 wherein said nucleotide 
20 sequence is a member selected from the group consisting of SEQ ID NO: 1, 

2, 3, 4, 5, 6, 7, 8, 9, 19, 20, 21 and 22. 

74. An isolated polypeptide comprising a polypeptide having an amino 
acid sequence with at least 90 percent identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 10, 1 1, 12, 23, 24, 25, 26, 

25 27 and 28. 

75. The isolated polypeptide of claim 74 wherein said percent identity is 
at least 95 percent 

76. The isolated polypeptide of claim 74 wherein said percent identity is 
at least 98 percent 

30 77. The isolated polypeptide of claim 74 wherein said amino acid 

sequence is selected from the group consisting of SEQ ID NO: 10, 1 1 , 12, 23, 
24, 25, 26, 27 and 28. 
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78. A cell line comprising a recombinant form of the polynucleotide of 
claim 67. 

79 A cell line comprising a recombinant form of the polypeptide of 
5 claim 74. 

80. Use of a cell line of claim 78 or 79 in a method for identifying 
compositions which modulate expression or activity of hemojuvelin. 

81 . A composition for treating a disease of iron metabolism comprising 
a therapeutically effective amount of a polypeptide of claim 74 in a 

1 0 pharmaceutical^ acceptable carrier. 

82. A method for treating a disease of iron metabolism comprising 
administering to a patient in need thereof a therapeutically effective amount of 
a composition of claim 81 in a pharrnaceutically acceptable carrier 

83. The method according to claim 82, wherein said disease is juvenile 
15 hemochromatosis. 

84. A method for producing test data with respect to the modulation of 
HFE2A gene expression by a compound, comprising: 

(a) a genetic construct comprising a reporter gene operably linked to 
an HFE2A promoter under conditions supporting expression of said reporter 

20 gene; 

(b) determining a change in expression of said reporter gene as a 
result of said contacting, wherein said change shows modulation, and 

(c) producing test data with respect to the gene modulating activity of 
said test compound based on a change in expression of the determined genes 

25 indicating gene modulating activity. 

85. A method for producing test data with respect to the modulation of 
an HFE2A polypeptide activity by a compound, comprising: 

(a) contacting a test compound with an HFE2A polypeptide under 
conditions promoting an activity of said HFE2A polypeptide; 
30 (b) determining a change in activity of said polypeptide as a result of 

said contacting, wherein said change shows modulation, and 
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(c) producing test data with respect to the HFE2A polypeptide modulating 
activity of said test compound based on a change in an HFE2A polypeptide 
activity indicating modulating activity. 

5. 
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FIGURE 8A 

1 SEJc^GGAGCCAGGCC^Trc 

1 -M— G — B — P--G — Q--S--P--S — P--R--S--S — H — G--S — P — 

a a a s a a 

6 1 AGCACTCTCACTCTCCTGCTGCTCCTCTG 

21 -S--T--L,--T->L--L-~L--Ii--L--C--G--H--A--H--S--Q--C--K--I--L- 

12 1 CGCTGCAATGCTGAGTAOTTATCGTCCACTCTGAGCC^ 

41 -R--C"N--A"E--Y—V--S--S--T--L--S--L-~R--G--G-~G--S--S--G- 

181 GCACTT03AGGAGGAGGAGGAGGAGGCCXK3GGTGGAGGGGTGGGCT 

61 -A--L--R--G--G--G--G--G— G--R--G--G--G--V--G--S--G--G--L--C- 

241 CXSAGCCCTCCGCTCCTATGCGCTCTGCACTCGGCGCAC^ 

81 -R--A--L— -R^-S--Y-- A--I*--C--T--R--R--T--A--R--T--C--R~ G--D- 

• • • 

301 CTCGCCTTCCATTOTGCGGTACATC 

101 -L--A-~F--H--S--A— V--H--G--I-~E--D--L--M--I--Q-.-.H--N— C-~S- 

♦ ♦ ♦ 

3 61 CK3CC^5GGCCCTACAGCCCCTCCCCCGCCCCGGGGCCCCGCCCTTC 

121 -R--Q--G--P--^--A--P--P--P--P~-R--G--P--A--:L--P--G---A--G--S- 

421 GGCCTCCCnKX:CCGGACCCrrTGTGACrr 

141 -G- -L--P — A--P — D — P- -O -D- -Y — E- -G- -R- -P- -S — R- -L — H- -G — R- 

481 CCCCCGGGGTTCTTGCATTGCGCTTC 

161 -P--P->G--F--L--H--C-~A^-S--F-«G--D ] | r P--H--V-->R---S--P---H--H- 

541 CACTTTCACACATGCC^TGTCCAAGGAGeTTGGCCT 

181 ~H- -F- -H--T--C--R- -V- -Q- -G- -A- -W--P- -L — It- -D--H- -D--F--li — F- 



601 GT^CAAGCCACCAGCTCCCCCATGGCGT 

201 - V- -Q- -A- -T- -S- -S - - P- -M- -A- -I*- -G- -A- -N- -A- -T- -A — T- -R- - K- -L- 

♦ ♦ ♦ 

661 ACCATCATATTTAAGAAC&TGCAGGAATGCATO 

221 -T--I--I--F--K--N--M--Q--E--C--I--D--Q--K--V--Y--Q--A--E--V- 

721 GATAATCTTCCTGTAGCCTTTGAAGATGGT^ 

241 -D--N--L--P--V--A--F--E--D--G--S--I--N--G--G---D--R--P—G--G- 



781 TCCAGTTTGTCGATTCAAACTGCTAAC^ 

261 ~s--S--Ii--S-"I--Q--T-~A — N--P--G--N--H--V--E--I — Q--A--A--Y- 

a 

841 ATTGGCACAACTATAATCATTCGGCAGACAGCTO 

281 -I--G- -T — T--I--I--I--R--Q--T--A--G--Q--L--S--F--S--X--K--V- 

901 GCAGAGGATGTGGCCATGGCCTTCTCAGCTGAACAGGACCT^ 

301 -A--B--D--V--A--M--A--F--S--A--E--Q--D--I#--Q--I* — C--V--G--G- 



961 TGCCCTCGAAGTCAGCGACTCTCTOTATCAGAGOT 

321 -C--P--P--S--Q--R--L--S--R--S--E--R--N--R--R^G--A--I--T--I- 

1021 GATACTGC(^VGACGGCTGTGCAAGGAAGGGCTTC 

341 -D- -T- -A- -R- -R — L--C- -K- -E--G- -L--P- -V- ~E- -D- -A- -Y--F- -H- -S- 
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FIGURE 8B 



1081 TGTGTCTTTGATGTTTTAATT^ 

361 - C --V--F--D--V — L--I--Sr--G--D--P — N--F- -T- -V- -A- -A- -Q--A- -A- 

♦ ♦ ♦ 

1141 CTCGAGGATGCCCX3AQCCTTCCTGCCAGACITAG 

361 -L--E--D--A~-R--A--F--L--P--D--L-~E--K~-L--H"-Ij--F--P--S--D- 

JX 

1201 GCIX^GGTTCCrCTTTCCTCAGCAACCCTCTTAGCTC 

401 -A- -G- -V- -P--Ii--S--S- -A- -T--L— L--A--P--L1--I1- -S--G- -L- -F--V- 
1261 CTGTGGCTTTGCATTCAGTAAGGGGACCATCAGTCCC^ 

421 -Ii--W--L--C--I--Q--* 1 
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SEQUENCE LISTING 



<110> Xenon Genetics Inc. 

<120> Juvenile Hemochromatosis Gene (HFE2A) , Expression Products 
and Uses Thereof 

<130> 836-115PCT 

<140> N/A 

<141> 2004-04-08 

<150> 60/461,615 
<151> 2003-04-09 

<150> 60/462,867 
<151> 2003-04-15 

<150> 60/488,607 
<151> 2003-07-18 

<150> 60/498,458 
<151> 2003-08-28 

<160> 62 

<170> Patentln version 3.0 

<210> 1 

<211> 149 

<212> DMA 

<213> Homo sapiens 

<400> 1 

cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 60 

ggaggacccc ctggctggag ctgacccaca gagtagggaa tcatggctgg agaattggat 120 

agcagagtaa tgtttgacct ctggaaaca 149 



<210> 2 

<211> 204 

. <212> DNA 

<213> Homo sapiens 

<400> 2 

ccaaatttct tttttcagtc acttacaggg cttccggtca aaattcacta ggtaggaggg 
tcatcagctg ggaagaaccg gcgcctggga aacctggctg gataggtatg ggggagccag 
gccagtcccc tagtcccagg tcctcccatg gcagtccccc aactctaagc actctcactc 
tcctgctgct cctctgtgga catg 



60 
120 
180 
204 



<210> 3 

<211> 37 

<212> DNA 

<213> Homo sapiens 
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<400> 3 

ctcattctca atgcaagatc ctccgctgca atgctga 



37 



<210> 4 

<211> 560 

<212> DNA 

<213> Homo sapiens 



<400> 4 

ctcattctca 

ttagaggtgg 

gggtgggctc 

ccgcccgcac 

tgatgatcca 

ccgcccttcc 

ggttttcccg 

cccatgtgcg 

tactggataa 

acgctaccgc 



atgcaagatc 
gggttcatca 
tggcggcctc 
ctgccgcggg 
gcacaactgc 
aggcgcgggc 
gctgcatggt 
cagcttccac 
tgacttcctc 
cacccggaag 



ctccgctgca 
ggagcacttc 
tgtcgagccc 
gacctcgcct 
tcccgccagg 
tccggcctcc 
cgtcccccgg 
catcactttc 
tttgtccaag 



atgctgagta 
gaggaggagg 
tccgctccta 
tccattcggc 
gccctacagc 
ctgccccgga 
ggttcttgca 
acacatgccg 
ccaccagctc 



cgtatcgtcc 
aggaggaggc 
tgcgctctgc 
ggtacatggc 
ccctcccccg 
cccttgtgac 
ttgcgcttcc 
tgtccaagga 
ccccatggcg 



actctgagcc 
cggggtggag 
actcggcgca 
atcgaagacc 
ccccggggcc 
tatgaaggcc 
ttcggggacc 
gcttggcctc 
ttgggggcca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
560 



<210> 5 

<211> 1230 

<212> DHA 

<213> Homo sapiens 



<400> 5 

ctcaccatca 

gtggataatc 

ggatccagtt 

tacattggca 

gtagcagagg 

gggtgccctc 

attgatactg 

tcctgtgtct 

gcactggagg 

gatgctgggg 

gttctgtggc 

ttggagatac 

agacacgtga 

tatcacagaa 

accaattttt 

tcacccttat 

agatttttaa 

aaggaggaag 

gttctgcttt 

ctctgctttt 

ctgttattaa 



tatttaagaa 
ttcctgtagc 
tgtcgattca 
caactataat 
atgtggccat 
caagtcagcg 
ccagacggct 
ttgatgtttt 
atgcccgagc 
ttcctctttc 
tttgcattca 
agattggcat 
aacaatgaca 
taaggattct 
ccagcctcat 
aagtttagag 
gaggcaagag 
acataattag 
cttgattcaa 
gaagaatggc 
agctacaaat 



catgcaggaa 
ctttgaagat 
aactgctaac 
cattcggcag 
ggccttctca 
actctctcga 
gtgcaaggaa 
aatttctggt 
cttcctgcca 
ctcagcaacc 
gtaaggggac 
agaagaatgt 
ttatccagag 
gggcaaggtt 
ttatagtaaa 
gtcatgaagg 
ctgaaagcag 
ggaaagaagt 
aaatgaagca 
acaggtagga 
tcttcacacc 



tgcattgatc 
ggttctatca 
cctgggaacc 
acagctgggc 
gctgaacagg 
tcagagcgca 
gggcttccag 
gatcccaact 
gacttagaga 
ctcttagctc 
catcagtccc 
aaagaatcat 
tcagatgagg 
actgcattcc 
caaattgttc 
ttttaatgat 
aagacatgat 
ctatttgatg 
ggcattgtct 
cagaagtatc 



agaaggtgta 
atggaggtga 
atgtggagat 
agctctcctt 
acctgcagct 
atcgtcgggg 
tggaagatgc 
ttaccgtggc 
agctgcatct 
cactcctttc 
attactagtt 
taaaggaagc 
ctgcagtcca 
ggatctctgt 
taatccattt 
cagtaaagat 
cattagccat 
aatatgtgtg 
agctcttagg 
atccctaccc 



tcaggctgag 
ccgacctggg 
ccaagctgcc 
ctccatcaag 
ctgtgttggg 
agctataacc 
ttacttccat 
agctcaggca 
cttcccctca 
tgggctcttt 
tggaaatgat 
agggcctagg 
gggttgaaat 
ggggctcttc 
actgcagatt 
ttaagggttg 
aagaaactca 
tgtaaggtat 
tgaagggagt 
cctaactaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1230 



<210> 6 

<211> 1379 

<212> DNA 

<213> Homo sapiens 



<400> 6 
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cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 60 

ggaggacccc ctggctggag ctgacccaca gagtagggaa tcatggctgg agaattggat 120 

agcagagtaa tgtttgacct ctggaaacac tcaccatcat atttaagaac atgcaggaat 180 

gcattgatca gaaggtgtat caggctgagg tggataatct tcctgtagcc tttgaagatg 240 

gttctatcaa tggaggtgac cgacctgggg gatccagttt gtcgattcaa actgctaacc 3 00 

ctgggaacca tgtggagatc caagctgcct acattggcac aactataatc attcggcaga 360 

cagctgggca gctctccttc tccatcaagg tagcagagga tgtggccatg gccttctcag 420 

ctgaacagga cctgcagctc tgtgttgggg ggtgccctcc aagtcagcga ctctctcgat 480 

cagagcgcaa tcgtcgggga gctataacca ttgatactgc cagacggctg tgcaaggaag 540 

ggcttccagt ggaagatgct tacttccatt cctgtgtctt tgatgtttta atttctggfcg 600 

atcccaactt taccgtggca gctcaggcag cactggagga tgcccgagcc ttcctgccag . 660 

acttagagaa gctgcatctc ttcccctcag atgctggggt tcctctttcc tcagcaaccc 720 

tcttagctcc actcctttct gggctctttg ttctgtggct ttgcattcag taaggggacc 780 

atcagtccca ttactagttt ggaaatgatt tggagataca gattggcata gaagaatgta 840 

aagaatcatt aaaggaagca gggcctagga gacacgtgaa acaatgacat tatccagagt 900 

cagatgaggc tgcagtccag ggttgaaatt atcacagaat aaggattctg ggcaaggtta 960 

ctgcattccg gatctctgtg gggctcttca ccaatttttc cagcctcatfc tatagtaaac 1020 

aaattgttct aatccattta ctgcagattt cacccttata agtttagagg tcatgaaggt 1080 

tttaatgatc agtaaagatt taagggttga gatttttaag aggcaagagc tgaaagcaga 1140 

agacatgatc attagccata agaaactcaa aggaggaaga cataattagg gaaagaagtc 1200 

tatttgatga atatgtgtgt gtaaggtatg ttctgctttc ttgattcaaa aatgaagcag 1260 

gcattgtcta gctcttaggt gaagggagtc tctgcttttg aagaatggca caggtaggac 1320 

agaagtatca tccctacccc ctaactaatc tgttattaaa gctacaaatt cttcacacc 1379 



<210> 7 

<211> 1416 

<212> DNA 

<213> Homo sapiens 



<400> 7 

cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 60 

ggaggacccc ctggctggag ctgacccaca gagtagggaa tcatggctgg agaattggat 120 

agcagagtaa tgtttgacct ctggaaacac tcattctcaa tgcaagatcc tccgctgcaa 180 

tgctgactca ccatcatatt taagaacatg caggaatgca ttgatcagaa ggtgtatcag 240 

gctgaggtgg ataatcttcc tgtagccttt gaagatggtt ctatcaatgg aggtgaccga 300 

cctgggggat ccagtttgtc gattcaaact gctaaccctg ggaaccatgt ggagatccaa 360 

gctgcctaca ttggcacaac tataatcatt cggcagacag ctgggcagct ctccttctcc 420 

atcaaggtag cagaggatgt ggccatggcc ttctcagctg aacaggacct gcagctctgt 480 

gttggggggt gccctccaag tcagcgactc tctcgatcag agcgcaatcg tcggggagct 540 

ataaccattg atactgccag acggctgtgc aaggaagggc ttccagtgga agatgcttac 600 

ttccattcct gtgtctttga tgttttaatt tctggtgatc ccaactttac cgtggcagct 660 

caggcagcac tggaggatgc ccgagccttc ctgccagact tagagaagct gcatctcttc 720 

ccctcagatg ctggggttcc tctttcctca gcaaccctct tagctccact cctttctggg 780 

ctctttgttc tgtggctttg cattcagtaa ggggaccatc agtcccatta ctagtttgga 840 

aatgatttgg agatacagat tggcatagaa gaatgtaaag aatcattaaa ggaagcaggg 900 

cctaggagac acgtgaaaca atgacattat ccagagtcag atgaggctgc agtccagggt 960 

tgaaattatc acagaataag gattctgggc aaggttactg cattccggat ctctgtgggg 1020 

ctcttcacca atttttccag cctcatttat agtaaacaaa ttgttctaat ccatttactg 1080 

cagatttcac ccttataagt ttagaggtca tgaaggtttt aatgatcagt aaagatttaa 1140 

gggttgagat ttttaagagg caagagctga aagcagaaga catgatcatt agccataaga 1200 

aactcaaagg aggaagacat aattagggaa agaagtctat ttgatgaata tgtgtgtgta 1260 

aggtatgttc tgctttcttg attcaaaaat gaagcaggca ttgtctagct cttaggtgaa 1320 

gggagtctct gcttttgaag aatggcacag gtaggacaga agtatcatcc ctacccccta 1380 

actaatctgt tattaaagct acaaattctt cacacc 1416 
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<210> 8 

<211> 1939 

<212> DNA 

<213> Homo sapiens 



<400> 8 

cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 60 

ggaggacccc ctggotggag ctgacccaca gagtagggaa tcatggctgg agaattggat 120 

agcagagtaa tgtttgacct ctggaaacac tcattctcaa tgcaagatcc tccgctgcaa 1B0 

tgctgagtac gtatcgtcca ctctgagcct tagaggtggg ggttcatcag gagcacttcg 240 

aggaggagga ggaggaggcc ggggtggagg ggtgggctct ggcggcctct gtcgagccct 300 

ccgctcctat gcgctctgca ctcggcgcac cgcccgcacc tgccgcgggg acctcgcctt 3 60 

ccattcggcg gtacatggca tcgaagacct gatgatccag cacaactgct cccgccaggg 420 

ccctacagcc cctcccccgc cccggggccc cgcccttcca ggcgcgggct ccggcctccc 4 80 

tgccccggac ccttgtgact atgaaggccg gttttcccgg ctgcatggtc gtcccccggg 540 

gttcfctgcat tgcgcttcct tcggggaccc ccatgtgcgc agcttccacc atcactttca 600 

cacatgccgt gtccaaggag cttggcctct actggataat gacttcctct ttgtccaagc 660 

caccagctcc cccatggcgt tgggggccaa cgctaccgcc acccggaagc tcaccatcat 720 

atttaagaac atgcaggaat gcattgatca gaaggtgtat caggctgagg tggataatct 780 

tcctgtagcc tttgaagatg gttctatcaa tggaggtgac cgacctgggg gatccagttt 840 

gtcgattcaa actgctaacc ctgggaacca tgtggagatc caagctgcct acattggcac 900 

aactataatc attcggcaga cagctgggca gctctccttc tccatcaagg tagcagagga 960 

tgtggccatg gccttctcag ctgaacagga cctgcagctc tgtgttgggg ggtgccctcc 1020 

aagtcagcga ctctctcgat cagagcgcaa tcgtcgggga gctataacca ttgatactgc 1080 

cagacggctg tgcaaggaag ggcttccagt ggaagatgct tacttccatt cctgtgtctt 1140 

tgatgtttta atttctggtg atcccaactt taccgtggca gctcaggcag cactggagga 1200 

tgcccgagcc ttcctgccag acttagagaa gctgcatctc ttcccctcag atgctggggt 1260 

tcctctttcc tcagcaaccc tcttagctcc actcctttct gggctctttg ttctgtggct 1320 

ttgcattcag taaggggacc atcagtccca ttactagttt ggaaatgatt tggagataca 13 80 

gattggcata gaagaatgta aagaatcatt aaaggaagca gggcctagga gacacgtgaa 1440 

acaatgacat tatccagagt cagatgaggc tgcagtccag ggttgaaatt atcacagaat 1500 

aaggattctg ggcaaggtta ctgcattccg gatctctgtg gggctcttca ccaatttttc 1560 

cagcctcatt tatagtaaac aaattgttct aatccattta ctgcagattt cacccttata 1620 

agtttagagg tcatgaaggt tttaatgatc agtaaagatt taagggttga gatttttaag 1680 

aggcaagagc tgaaagcaga agacatgatc attagccata agaaactcaa aggaggaaga 1740 

cataattagg gaaagaagtc tatttgatga atatgtgtgt gtaaggtatg ttctgctttc 1800 

ttgattcaaa aatgaagcag gcattgtcta gctcttaggt gaagggagtc tctgcttttg 1860 

aagaatggca caggtaggac agaagtatca tccctacccc ctaactaatc tgttattaaa 1920 

gctacaaatt cttcacacc 1939 



<210> 9 

<211> 2143 

<212> DNA 

<213> Homo sapiens 

<400> 9 

cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 60 

ggaggacccc ctggctggag ctgacccaca gagtagggaa tcatggctgg agaattggat 120 

agcagagtaa tgtttgacct ctggaaacac caaatttctt ttttcagtca cttacagggc 180 

ttccggtcaa aattcactag gtaggagggt catcagctgg gaagaaccgg cgcctgggaa 240 

acctggctgg ataggtatgg gggagccagg ccagtcccct agtcccaggt cctcccatgg 300 

cagtccccca actctaagca ctctcactct cctgctgctc ctctgtggac atgctcattc 360 

tcaatgcaag atcctccgct gcaatgctga gtacgtatcg tccactctga gccttagagg 420 

tgggggttca tcaggagcac ttcgaggagg aggaggagga ggccggggtg gaggggtggg 480 
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ctctggcggc ctctgtcgag ccctccgctc ctatgcgctc tgcactcggc gcaccgcccg 540 

cacctgccgc ggggacctcg ccttccattc ggcggtacat ggcatcgaag acctgatgat 600 

ccagcacaac tgctcccgcc agggccctac agcccctccc ccgccccggg gccccgccct 660 

tccaggcgcg ggctccggcc tccctgcccc ggacccttgt gactatgaag gccggttttc 720 

ccggctgcat ggtcgtcccc cggggttctt gcattgcgct tccttcgggg acccccatgt 780 

gcgcagcttc caccatcact ttcacacatg ccgtgtccaa ggagcttggc ctctactgga 840 

taatgacttc ctctttgtcc aagccaccag ctcccccatg gcgttggggg ccaacgctac 900 

cgccacccgg aagctcacca tcatatttaa gaacatgcag gaatgcattg atcagaaggt 960 

gtatcaggct gaggtggata atcttcctgt agcctttgaa gatggttcta tcaatggagg 1020 

tgaccgacct gggggatcca gtttgtcgat tcaaactgct aaccctggga accatgtgga 1080 

gatccaagct gcctacattg gcacaactat aatcattcgg cagacagctg ggcagctctc 1140 

cttctccatc aaggtagcag aggatgtggc catggccttc tcagctgaac aggacctgca 1200 

gctctgtgtt ggggggtgcc ctccaagtca gcgactctct cgatcagagc gcaatcgtcg 1260 

gggagctata accattgata ctgccagacg gctgtgcaag gaagggcttc cagtggaaga 1320 

tgcttacttc cattcctgtg tctttgatgt tttaatttct ggtgatccca actttaccgt 1380 

ggcagctcag gcagcactgg aggatgcccg agccttcctg ccagacttag agaagctgca 1440 

tctcttcccc tcagatgctg gggttcctct ttcctcagca accctcttag ctccactcct 1500 

ttctgggctc tttgttctgt ggctttgcat tcagtaaggg gaccatcagt cccattacta 1560 

gtttggaaat gatttggaga tacagattgg catagaagaa tgtaaagaat cattaaagga 1620 

agcagggcct aggagacacg tgaaacaatg acattatcca gagtcagatg aggctgcagt 1680 

ccagggttga aattatcaca gaataaggat tctgggcaag gttactgcat tccggatctc 1740 

tgtggggctc ttcaccaafct tttccagcct catttatagt aaacaaattg ttctaatcca 1800 

tttactgcag atttcaccct tataagttta gaggtcatga aggttttaat gatcagtaaa 1860 

gatttaaggg ttgagatttt taagaggcaa gagctgaaag cagaagacat gatcattagc 1920 

cataagaaac tcaaaggagg aagacataat tagggaaaga agtctatttg atgaatatgt 1980 

gtgtgtaagg tatgttctgc tttcttgatt caaaaatgaa gcaggcattg tctagctctt 2040 

aggtgaaggg agtctctgct tttgaagaat ggcacaggta ggacagaagt atcatcccta 2100 

ccccctaact aatctgttat taaagctaca aattcttcac acc 2143 



<210> " 10 

<211> 200 

<212> PRT 

<213> Homo sapiens 



<400> 10 

Met Gin Glu Cys He Asp Gin Lys 
1 5 

Leu Pro Val Ala Phe Glu Asp Gly 
20 

Gly Gly Ser Ser Leu Ser He Gin 
35 40 

Glu He Gin Ala Ala Tyr He Gly 
50 55 



Val Tyr Gin Ala Glu Val Asp Asn 
10 15 

Ser He Asn Gly Gly Asp Arg Pro 
25 30 

Thr Ala Asn Pro Gly Asn His Val 
45 

Thr Thr He He He Arg Gin Thr 
60 



Ala Gly Gin Leu Ser Phe Ser He 
65 70 

Ala Phe Ser Ala Glu Gin Asp Leu 
85 

Pro Ser Gin Arg Leu Ser Arg Ser 
100 



Lys Val Ala Glu Asp Val Ala Met 
75 80 

Gin Leu Cys Val Gly Gly Cys Pro 
90 95 

Glu Arg Asn Arg Arg Gly Ala He 
105 110 
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Thr lie Asp Thr Ala Arg Arg Leu 
115 120 

Asp Ala Tyr Phe His Ser Cys Val 
130 135 

Pro Asn Phe Thr Val Ala Ala Gin 
145 150 

Phe Leu Pro Asp Leu Glu Lys Leu 
165 

Val Pro Leu Ser Ser Ala Thr Leu 
180 

Phe Val Leu Trp Leu Cys He Gin 
195 200 



Cys Lys Glu Gly Leu Pro Val Glu 
125 

Phe Asp Val Leu He Ser Gly Asp 
140 

Ala Ala Leu Glu Asp Ala Arg Ala 
155 160 

His Leu Phe Pro Ser Asp Ala Gly 
170 175 

Leu Ala Pro Leu Leu Ser Gly Leu 
185 190 



<210> 11 

<211> 313 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Met lie Gin His Asn Cys Ser Arg Gin Gly Pro Thr Ala Pro Pro Pro 
15 10 15 

Pro Arg Gly Pro Ala Leu Pro Gly Ala Gly Ser Gly Leu Pro Ala Pro 
20 25 30 

Asp Pro Cys Asp Tyr Glu Gly Arg Phe Ser Arg Leu His Gly Arg Pro 
35 40 45 

Pro Gly Phe Leu His Cys Ala Ser Phe Gly Asp Pro His Val Arg Ser 
50 55 60 

Phe His His His Phe His Thr Cys Arg Val Gin Gly Ala Trp Pro Leu 
65 70 75 ~ 80 

Leu Asp Asn Asp Phe Leu Phe Val Gin Ala Thr Ser Ser Pro Met Ala 
85 90 95 

Leu Gly Ala Asn Ala Thr Ala Thr Arg Lys Leu Thr He He Phe Lys 
100 105 no 

Asn Met Gin Glu Cys He Asp Gin Lys Val Tyr Gin Ala Glu Val Asp 
115 120 125 

Asn Leu Pro Val Ala Phe Glu Asp Gly Ser He Asn Gly Gly Asp Arg 
130 135 140 

Pro Gly Gly Ser Ser Leu Ser He Gin Thr Ala Asn Pro Gly Asn His 
145 150 155 160 
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Val Glu lie Gin Ala Ala Tyr lie Gly Thr Thr He He lie Arg Gin 
165 170 175 

Thr Ala Gly Gin Leu Ser Phe Ser He Lys Val Ala Glu Asp Val Ala 
180 185 190 

Met Ala Phe Ser Ala Glu Gin Asp Leu Gin Leu Cys Val Gly Gly Cys 
195 200 205 

Pro Pro Ser Gin Arg Leu Ser Arg Ser Glu Arg Asn Arg Arg Gly Ala 
210 215 220 

He Thr He Asp Thr Ala Arg Arg Leu Cys Lys Glu Gly Leu Pro Val 
225 230 235 240 

Glu Asp Ala Tyr Phe His Ser Cys Val Phe Asp Val Leu He Ser Gly 

245 250 255 

Asp Pro Asn Phe Thr Val Ala Ala Gin Ala Ala Leu Glu Asp Ala Arg 
260 265 270 

Ala Phe Leu Pro Asp Leu Glu Lys Leu His Leu Phe Pro Ser Asp Ala 
275 ^ 280 285 

Gly Val Pro Leu Ser Ser Ala Thr Leu Leu Ala Pro Leu Leu Ser Gly 
290 295 300 

Leu Phe Val Leu Trp Leu Cys He Gin 
305 310 



<2UD> 12 

<211> 426 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Gly Glu Pro Gly Gin Ser Pro Ser Pro Arg Ser Ser His Gly Ser 
1 5 10 is 

Pro Pro Thr Leu Ser Thr Leu Thr Leu Leu Leu Leu Leu Cys Gly His 
20 25 30 

Ala His Ser Gin Cys Lys lie Leu Arg Cys Asn Ala Glu Tyr Val Ser 
35 40 45 

Ser Thr Leu Ser Leu Arg Gly Gly Gly Ser Ser Gly Ala Leu Arg Gly 
50 55 60 

Gly Gly Gly Gly Gly Arg Gly Gly Gly Val Gly Ser Gly Gly Leu Cys 
65 70 75 80 

Arg Ala Leu Arg Ser Tyr Ala Leu Cys Thr Arg Arg Thr Ala Arg Thr 
85 90 95 

Cys Arg Gly Asp Leu Ala Phe His Ser Ala Val His Gly He Glu Asp 
100 105 no 
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Leu Met lie Gin His Asn Cys Ser Arg Gin Gly Pro Thr Ala Pro Pro 
115 120 125 

Pro Pro Arg Gly Pro Ala Leu Pro Gly Ala Gly Ser Gly Leu Pro Ala 
130 135 140 

Pro Asp Pro Cys Asp Tyr Glu Gly Arg Phe Ser Arg Leu His Gly Arg 
145 150 155 160 

Pro Pro Gly Phe Leu His Cys Ala Ser Phe Gly Asp Pro His Val Arg 
165 170 175 

Ser Phe His His His Phe His Thr Cys Arg Val Gin Gly Ala Trp Pro 
180 185 190 

Leu Leu Asp Asn Asp Phe Leu Phe Val Gin Ala Thr Ser ser Pro Met 
195 200 205 

Ala Leu Gly Ala Asn Ala Thr Ala Thr Arg Lys Leu Thr He He Phe 
210 215 220 

Lys Asn Met Gin Glu Cys He Asp Gin Lys Val Tyr Gin Ala Glu Val 
225 230 235 240 

Asp Asn Leu Pro Val Ala Phe Glu Asp Gly Ser He Asn Gly Gly Asp 
245 250 255 

Arg Pro Gly Gly Ser Ser Leu Ser He Gin Thr Ala Asn Pro Gly Asn 
260 265 270 

His Val Glu He Gin Ala Ala Tyr He Gly Thr Thr He He He Arg 
275 280 285 

Gin Thr Ala Gly Gin Leu Ser Phe Ser He Lys Val Ala Glu Asp Val 
290 295 3 00 

Ala Met Ala Phe Ser Ala Glu Gin Asp Leu Gin Leu Cys Val Gly Gly 
305 310 315 320 

Cys Pro Pro Ser Gin Arg Leu Ser Arg Ser Glu Arg Asn Arg Arg Gly 
325 330 335 

Ala He Thr He Asp Thr Ala Arg Arg Leu Cys Lys Glu Gly Leu Pro 
340 345 350 

Val Glu Asp Ala Tyr Phe His Ser Cys Val Phe Asp Val Leu He Ser 
355 360 365 

Gly Asp Pro Asn Phe Thr Val Ala Ala Gin Ala Ala Leu Glu Asp Ala 
370 375 380 

Arg Ala Phe Leu Pro Asp Leu Glu Lys Leu His Leu Phe Pro Ser Asp 
385 390 395 400 

Ala Gly Val Pro Leu Ser Ser Ala Thr Leu Leu Ala Pro Leu Leu Ser 
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405 



410 



415 



Gly Leu Phe Val Leu Trp Leu Cys lie Gin 
420 425 



<210> 13 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Polynucleotide replication primer 
<400> 13 

gctgcctaca ttggcacaac t 

<210> 14 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Polynucleotide replication primer 
<400> 14 

gctgcctaca ctggcacaac t 



<210> 15 

<21l> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Polynucleotide replication primer 
<400> 15 

tgtgttgggg ggtgccctcc a 

<210> 16 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Polynucleotide replication primer 
<400> 16 

tgtgttgggg tgtgccctcc a 



<210> 17 
<211> 21 
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<212> DNA 

<213> Artificial 

<220> 

<223> Polynucleotide replication primer 
<400> 17 

tacttccatt cctgtgtctt t 



<210> IS 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Polynucleotide replication primer 

<400> 18 

tacttccatt ctgtgtcttt 20 



<210> 19 

<211> 1919 

<212> DNA 

<213> Homo sapiens 

<400> 19 

tctatataga tattaataaa tctagagaga cagaaagcag actggtgatg gccagtctag 60 

atggctagat agatagacat ggatatagat atagatctct atatagatag aggtagatac 120 

agatatagat atatgcccta ttagttctgt tcctctagag aaccctaata cagtgaccgt 180 

atttggaatc ggtccttctg ttaatttcac ttggcaagta ctaaaagatg atgatctcag 240 

atatacctat ggctgcaaaa acatgacatg gctaaatccc ttggttgcag tatctctttt 3 00 

cttttttaag gggggtgggg gggcgggtct cactgttgcc caggctggag tgcaatggcg 3 60 

ttatcatagc tcactgcagc ctcaaactcc tgcgctcaag tgaccctcct gcctcagctc 420 

ccaaagtgct gagattttgc aatatttatg gtcacaagat tatgttattc cataaaagta 480 

tctttctgag gctaggcatg ttggttcaca cttgtaatcc cagcactctg agaggctgag 540 

atggaaggat tcattgaggc aaggagttca agaccagcct ggtcaacata gtgagacctc 600 

atctcggaag gaaggaagga aggagggagg gaggaaggga gggagtgaag gaaggaagga 660 

aggaaggaag gaaggaagga aggaaggaag gaaggaagga aaagtatatt tttgaatctt 720 

tttctatttc tccaactctt tctttagaag aattctattt ccattctttc ttcacctctt 780 

tgcctttgtt agccttctct ccaagcaaat cgggagcctt tattttttgt gtattcatga 840 

gggagaggaa gatgaattgc tgtacaaact aaagtaatga aaatggagta ggtaggagga 900 

tagacagctg caaggatctg agctggatag actgaacaaa ccctcatcct aagcaactca 960 

cagctcagat ttcttctctg gacagctggc ttttttcgtc cttctgaaat actctgcaaa 102 0 

gataggagag gggctatgaa ctacctctgc tatggatctt attcaaagtc agctacctcc 1080 

tagatactat ctgtagaacc taaatgtaat attcagcata gcagggatga acatggtaaa 1140 

tgaaaggtat ccaattgccc actgtaattt ttaaaggcca ggagctcaac attattgaaa 1200 

atgctggagg gctgcctgga gtaggcagtg accacagagt cacacaagct ggaattggat 1260 

atccaacttg tctgtcatat ttctctcctc cctccctgac ttggcactca atactccata 1320 

ttctttctaa tcctctaacc ctccccactc ccccaactcc cacaccctac ccccaccaac 1380 

gttcctggaa ttttggactt agctattttt aaaaccgtca actcagtagc cacctccctc 1440 

cctgctcagc tgtccagtac tctggccagc catatactcc cccttccccc cataccaaac 1500 

cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 1560 

ggaggacccc ctggctggag ctgacccaca gagtagggaa tcatggctgg agaattggat 1620 

agcagagtaa tgtttgacct ctggaaacag taagtcaaaa tgaaattgca attcctttaa 1680 
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taagctttta tattgaagtt agacttttat aaaattacaa 
cgtccaaatg ctgggatctc tccctaccaa ggtgccccaa 
ttatttcttt ctggcctctg gcctctagct ttttgaagtt 
ggcagtctta gccctctctt taccttatta cctcaagact 



acacctactt ggatgtctct 1740 

tctccatttc tctttctgtc 1800 

taattctctg tctctcctct i860 

cctgatgaag ttttagaag 1919 



<210> 20 

<2ll> 1020 

<212> DNA 

<213> Mus musculus 



<400> 20 
tgtattgacg 
aatagacagc 
ctgctctcgg 

atgggggagg 

accatgtgtg 

ggcgtcacac 

aaaggacctg 

tctctttagg 

catcattcct 

acctagctat 

agtgcttggg 

ctctctgacc 

agatggagga 

gggtatcaga 

ttgataagct 

ctaagatctc 

tattccaaag 



aggaggaggg 
tgcagagatc 
gtttcttctc 
gtctctgaac 
gaaactcagt 
aaatcaccac 
agtggcaggt 
cagtccctca 
gctgccttag 
ttttaaaact 
ccaaccatat 
tcggtgagat 
ccccctggct 
otaatgcttg 

tfct gggtcga 

tccttaccag 

aa gggttttg 



attgttgcac 
tgatccggac 
cagacagctg 
tacgcctgcc 
ggcatcctca 
cagccgtctt 
catacaagct 
cttggtaccc 
ctcccacacc 
gtcaactcag 
actctccctg 
tgcagccagt 
ggagcagacc 
acctcgggaa 
ggctagaatt 
ataccccaat 
acaggagaaa 



acactagcgt 
agacagaata 
gcctttcggc 
atccatcttc 
gtaaagagag 
acaggccggg 
gcagttggac 
agcattccca 
ctactgccac 
gaggcacctc 
ccccctcccc 

cc gggggatc 

aacagaatag 
acagtaagtc 
tcataaagtt 
cttcactttt 
gggagacaga 



aatggaaaag 

aaccctcctc 

ttctgaaata 

aaagccagct 

gatgagaacg 

agctcattac 

atgggacttg 

tattccctct 

caacgttcct 

cctcctcctc 

ccacaccaaa 

ggggacagac 

gcaactatgg 

tagatgaaat 

acagacatct 

ggaccgcctg 

cccctcccaa 



gtggagatag 
cgaaacaatt 
gttggcggag 
acctctacgt 
gtgagtgaca 
tgagaacgct 
gccatcgttc 
ttattttgct* 
ggaattttgg 
tcagctgtcc 
gctcctctgg 
atggagaagg 
ctggagaacc 
ggcggttgct 
gttctgaaaa 
ctcatacact 
tatctgttcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



<210> 21 

<211> 1120 

<212> DMA 

<213> Rattus rattus 



<400> 21 
tctggcccca 
ccaccacccc 
attgttgcac 
tgatccggac 
cagacagctg 
tacgcctgcc 
ggcatcctca 
cagccgtctt 
catacaagct 
cttggtaccc 
ctcccacacc 
gtcaactcag 
actctccctg 
tgcagccagt 
ggagcagacc 
acctcgggaa 
ggctagaatt 
ataccccaat 
acaggagaaa 



cttctctgtg 
cgaaacccgg 
acactagcgt 
agacagaata 
gcctttcggc 
atccatcttc 
gtaaagagag 
acaggccggg 
gcagttggac 
agcattccca 
ctactgccac 
gaggcacctc 
ccccctcccc 

ccgggggatc 

aacagaatag 
acagtaagtc 
tcataaagtt 
cttcactttt 



gggagacaga 



gatcggtcat 
aagcttctgt 
aatggaaaag 
aaccctcctc 
ttctgaaata 
aaagccagct 
gatgagaacg 
agctcattac 
atgggacttg 
tattccctct 
caacgttcct 
cctcctcctc 
ccacaccaaa 

ggggacagac 

gcaactatgg 
tagatgaaat 
acagacatct 
ggaccgcctg 
cccctcccaa 



cttcaaaacc 
tttttgtgtt 
gtggagatag 
cgaaacaatt 
gttggcggag 
acctctacgt 
gtgagtgaca 
tgagaacgct 
gccatcgttc 
ttattttgct 
ggaattttgg 
tcagctgtcc 
gctcctctgg 
atggagaagg 
ctggagaacc 
ggcggttgct 
gttctgaaaa 
ctcatacact 
tatctgttcc 



agttcagctt 
tgtattgacg 
aatagacagc 
ctgctctcgg 
atgggggagg 
accatgtgtg 
ggcgtcacac 
aaaggacctg 
tctctttagg 
catcattcct 
acctagctat 
agtgcttggg 
ctctctgacc 
agatggagga 
gggtatcaga 
ttgataagct 
ctaagatctc 
tattccaaag 



cccactgccc 
aggaggaggg 
tgcagagatc 
gtttcttctc 
gtctctgaac 
gaaactcagt 
aaatcaccac 
agtggcaggt 
cagtccctca 
gctgccttag 
ttttaaaact 
ccaaccatat 
tcggtgagat 
ccccctggct 
gtaatgcttg 
tttgggtcga 
tccttaccag 
aagggttttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1120 
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<210> 22 

<211> 7265 

<212> DNA 

<213> Homo sapiens 

<400> 22 

tctatataga tattaataaa tctagagaga cagaaagcag actggtgatg gccagtctag 60 
atggctagat agatagacat ggatatagat atagatctct atatagatag aggtagatac 120 
agatatagat atatgcccta ttagttctgt tcctctagag aaccctaata cagfcgaccgt 180 
atttggaatc ggtccttcfcg ttaatttcac ttggcaagta ctaaaagatg atgatctcag 24 0 
atatacctat ggctgcaaaa acatgacatg gctaaatccc ttggttgcag tatctctttt 300 
cttttfctaag gggggtgggg gggcgggtct cactgttgcc caggctggag tgcaatggcg 3 60 
ttatcatagc tcactgcagc ctcaaactcc tgcgctcaag fcgaccctcct gcctcagctc 420 
ccaaagtgct gagattttgc aatatttatg gtcacaagat tatgttattc cataaaagta 480 
tctttctgag gctaggcatg ttggttcaca cttgtaatcc cagcactctg agaggctgag 540 
atggaaggat tcattgaggc aaggagttca agaccagcct ggtcaacata gtgagacctc 600 
atctcggaag gaaggaagga aggagggagg gaggaaggga gggagtgaag gaaggaagga 660 
aggaaggaag gaaggaagga aggaaggaag gaaggaagga aaagtatatt tttgaatctt 720 
tttctatttc tccaactctt tctttagaag aattctattt ccattctttc ttcacctctt 7 80 
tgcctttgtt agccttctct ccaagcaaat cgggagcctt tattttttgt gtattcatga 840 
gggagaggaa gatgaattgc 'tgtacaaact aaagtaatga aaatggagta ggtaggagga 900 
tagacagctg caaggatctg agctggatag actgaacaaa ccctcatcct aagcaactca 960 

cagctcagat ttcttctctg gacagctggc ttttttcgtc cttctgaaat actctgcaaa 1020 

gataggagag gggctatgaa ctacctctgc tatggatctt attcaaagtc agctacctcc 10 80 

tagatactat ctgtagaacc taaatgtaat attcagcata gcagggatga acatggtaaa 1140 

tgaaaggtat ccaattgccc actgtaattt ttaaaggcca ggagctcaac attattgaaa 12 00 

atgctggagg gctgcctgga gtaggcagtg accacagagt cacacaagct ggaattggat 1260 

atccaacttg tctgtcatat ttctctcctc cctccctgac ttggcactca atactccata 1320 

ttctttctaa tcctctaacc ctccccactc ccccaactcc cacaccctac ccccaccaac 13 80 

gttcctggaa ttttggactt agctattttt aaaaccgtca actcagtagc cacctccctc 1440 

cctgctcagc tgtccagtac tctggccagc catatactcc cccttccccc cataccaaac 1500 

cttctctggt tccctgacct cagtgagaca gcagccggcc tggggacctg ggggagacac 15 60 

ggaggacccc ctggctggag ctgacccaca gagtagggaa tcatggctgg agaattggat 162 0 

agcagagtaa tgtttgacct ctggaaacag taagtcaaaa tgaaattgca attcctttaa 1680 

taagctttta tattgaagtt agacttttat aaaattacaa acacctactt ggatgtctct 174 0 

cgtccaaatg ctgggatctc tccctaccaa ggtgccccaa tctccatttc tctttctgtc 180 0 

ttatttcttt ctggcctctg gcctctagct ttttgaagtt taattctctg tctctcctct I8 60 

ggcagtctta gccctctctt taccttatta cctcaagact cctgatgaag ttttagaagg 192 0 

agttccctac gtcctctatt ctgtagtttt cttaccaagg ccaaatatga cctcagatga 198 0 

tgagtcactg atacccttct atcctgcccc cacttagcaa tgcccttcac attgagattc 2040 

caagcatggg ggctgctccc tgtaaatgat ttctccccac aactctagtc cctccattct 2100 

attctccctc ttgcaggact cttcccccaa tcatatcctt acccataaga taggggagtt 2160 

aggcaggagg gatttagccc ctctccaact cctgtcatca taaaagactg agaacttcag 222 0 

aatttgaaaa gaagagatta atggaaggag tgatatttgg gaaaatacaa gaactgttga 2280 

cttagaaaaa acaaatattg atttgcatgt ttggtttgca tcccattatt ccatgagaga 2340 

gggagattaa aattgcagct ctctagagct gatgaaaaga gattggtttc cttttcattt 24 00 

gaatactgat attctagacg ggatgggtat gccaccctta atccttcttg tgttctgaca 2460 

caaaggagga aaagaaatgt atgactccta gagggcatct cctcctaatg gagagggaca 2520 

aataagaagt atgtttctga aatattttca ggtcctaatt ttactagggt acccactagg 25 80 

attactggta tctgatctag ccccatgatt cctccatctt tgacatacct gctgtttggt 2640 

agctcagaat ggagcaatac agtggactct gcccccttga gttcactcaa ccttccctcc 2700 

acccccacct aggggtattg cacaagggcc ctaaaagtgg ccacaggaac agggcaaaga 2760 

ggcttaactg ccacacttat agtttgagga actccaatct ccccaaattc cagtctgttc 2820 

atccttttct tgatctcccc agattcactc cacattatcc ttaccaatct tcaattcttc 2880 
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tctctctcca tgtccagcca aatttctttt ttcagtcact tacagggctt ccggtcaaaa 2940 

ttcactaggt aggagggtca tcagctggga agaaccggcg cctgggaaac ctggctggat 3000 

aggtatgggg gagccaggcc agtcccctag tcccaggtcc tcccatggca gtcccccaac 3060 

tctaagcact ctcactctcc tgctgctcct ctgtggacat ggtaaggaag ggccagggaa 312 0 

gggtttgggg aaatctagag ggtaggctgc tatgtagggg tgggcatgtg agcctgaatg 3180 

agtgaggaga gataggcgct gagagtcccg atcactcgcc ctgctctcaa atactaatat 3240 

tttatttccc gttcagtctg gggaaggcca ctggggaagc ccttggtcga caggcagaag 3300 

agatgtggca ggcttacaca cttttagtaa gacagccgag agaactaggg actagggggt 3360 

tgggggctgg ggaaggccct tagttaggtt ttaggaaggc tggaaacccc tgatgagatt 3420 

tggaagagtt atgagcaaac tacactccga tagagcagag gtctgaggac cgtctcacaa 3480 

tcctctccct tctgtcttta gctcattctc aatgcaagat cctccgctgc aatgctgagt 3540 

acgtatcgtc cactctgagc cttagaggtg ggggttcatc aggagcactt cgaggaggag 3600 

gaggaggagg ccggggtgga ggggtgggct ctggcggcct ctgtcgagcc ctccgctcct 3660 

atgcgctctg cactcggcgc accgcccgca cctgccgcgg ggacctcgcc ttccattcgg 3720 

cggtacatgg catcgaagac ctgatgatcc agcacaactg ctcccgccag ggccctacag 3780 

cccctccccc gccccggggc cccgcccttc caggcgcggg ctccggcctc cctgccccgg 3840 

acccttgtga ctatgaaggc cggttttccc ggctgcatgg tcgtcccccg gggttcttgc 3900 

attgcgcttc cttcggggac ccccatgtgc gcagcttcca ccatcacttt cacacatgcc 3960 

gtgtccaagg agcttggcct ctactggata atgacttcct ctttgtccaa gccaccagct 4020 

cccccatggc gttgggggcc aacgctaccg ccacccggaa ggtcaggcac tcaatcttcc 4080 

ttccgatcca cctcatgaga ttcttccacg ggcaccattc ctccccatcc ccactattca 4140 

acagcaatgc tcccfcaattc ccttttcttc ctcaacctct cccccatctc gaatcactcc 4200 

cttctaccaa acacctggag ctgtaaatca etfcccccttg atgggaattt gactcaaatg 4260 

cagaaaacct tgaagagaca gtcggagagg gcggacctga ggagtttcag aagggaaact 4320 

tttccctctc ctaggaagtt gccacgatta agtagagagg gggttaagta gggatgaggt 43 80 

aatactggaa cataaatagg agaagggatc aaggattgag ggccatagta gtcctgcatc 4440 

tctacttgga tcagatctct aactatgtat gaggtctgat tggggggaag atgcactgaa 4500 

cccaaaatga actgttttcc ctcttgfccct cacagctcac catcatattt aagaacatgc 4560 

aggaatgcat tgatcagaag gtgtatcagg ctgaggtgga taatcttcct gtagcctttg 4620 

aagatggttc tatcaatxgga ggtgaccgac ctgggggatc cagtttgtcg attcaaactg 4680 

ctaaccctgg gaaccatgtg gagatccaag ctgcctacat tggcacaact ataatcattc 4740 

ggcagacagc tgggcagctc tccttctcca tcaaggtagc agaggatgtg gccatggcct 4800 

tctcagctga acaggacctg cagctctgtg ttggggggtg ccctccaagt cagcgactct 4860 

ctcgatcaga gcgcaatcgt cggggagcta taaccattga tactgccaga cggctgtgca 4920 

aggaagggct tccagtggaa gatgcttact tccattcctg tgtctttgat gttttaattt 4980 

ctggtgatcc caactttacc gtggcagctc aggcagcact ggaggatgcc cgagccttcc 5040 

tgccagactt agagaagctg catctcttcc cctcagatgc tggggttcct ctttcctcag 5100 

caaccctctt agctccactc ctttctgggc tctttgttct gtggctttgc attcagtaag 5160 

gggaccatca gtcccattac tagtttggaa atgatttgga gatacagatt ggcatagaag 5220 

aatgtaaaga atcattaaag gaagcagggc ctaggagaca cgtgaaacaa tgacattatc 5280 

cagagtcaga tgaggctgca gtccagggtt gaaattatca cagaataagg afctctgggca 5340 

aggttactgc attccggatc tctgtggggc tcttcaccaa tttttccagc ctcatttata 5400 

gtaaacaaat tgttctaatc catttactgc agatttcacc cttataagtt tagaggtcat 5460 

gaaggtttta atgatcagta aagatttaag ggttgagatt tttaagaggc aagagctgaa 5520 

agcagaagac atgatcatta gccataagaa actcaaagga ggaagacata attagggaaa 5580 

gaagtctatt tgatgaatat gtgtgtgtaa ggtatgttct gctttcttga ttcaaaaatg 5640 

aagcaggcat tgtctagctc ttaggtgaag ggagtctctg cttttgaaga atggcacagg 5700 

taggacagaa gtatcatccc taccccctaa ctaatctgtt attaaagcta caaattcttc 5760 

acaccatcct ctgttgccta tgttgaatct ctttacagat gcttgaaatg gagtaaatgc 5820 

aatgtgttca ctccactgaa agagggctcg gaagtatcag atactgttgc tatctcaggg 5880 

agtttacagg ctattggaga gacaaaacca attcacatga aagagtgatg agtgtgtaat 5940 

tattcactaa atcctacagt atggtacatt cagatgggaa gatggtagat ttgaactaaa 6000 

gtaataagaa taataaaagg taacagagaa gatgggattt gaagtgagct ttgaagactg 6060 

ggtaagattc aaattgttaa cgatcttcca ggcaatgaaa accatctgga gttatgcatg 6120 

gattcatgat tcagcaagga aatgagcaaa actcaaatgc agtagacaag gagtaatggg 6180 

acagaaggtt agatgggcag aggccagatt atgaagcacc ttaaaaaggg ggtaaggggt 6240 
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ttaaatttga ttaatatcta acacttattg aaacttaaaa tctgccaggc aatgttttaa 6300 

acacttttaa aacattgact taattctcat agctctctaa ggaaggtggt attcttatct 63 60 

ctatttttat ataaggaaac ttgcctccag tcacacagct aacaaattat ggaacttgcc 6420 

tccagtcaca cagttaacaa atggcagagc cagaaactga acctatgccg fcttggatccc 6480 

gaaacttaat ttttaatcac tatactatat tgtcaaggaa gcagagagcc attaaagatt 6540 

ctggttgtgg agctggtaaa gattctaaaa ggggagttgt ggtggtcaat gtcaccacaa 6600 

aagctactct gaggctgggc gcggtggctg acacctgtaa tccagcactt tggtaggcca 6660 

atgtgggtgg atcacttgag gccaggagtt cgagaccagc ctgggcaaca tggtaaaacc 6720 

ccatctctac taaaaataca agaaatttgc caggcatgat agtccatgcc tgtaatcctg 6780 

taatccaagc tactcaggag gcagaggcat gagaatcgag aattgcttga acccgggcca 6840 

ggaggcagag gttgcagtga gctgagacca cgccactgca ctccagcctg ggcaacagag 690 0 

tgagactatc aaacaaaaaa caactactct cttctcatat tctcatataa agccaaaaag 6960 

agagagttgg aaaaggaggg aaagcccaaa gttgaaggaa tctagtttgt agaagaaaga 702 0 

ctgagaagaa atgctgttct agatgagggt gctctaaagt aacaatgctg ctctgaacaa 7080 

aattatgaag gaggtaactt ttacatttaa tatcttccct gtttctggtc tcatcctccc 7140 

ctctcaggta ctccataacc gaggacctgt cctccctgcc ctaatcaagt actcaccatt 7200 

atcttccacc tctcctctca gtccctgaca cccgacaata ctcccctgaa caaatattac 7260 

agtag 7265 



<2io> 23 

<211> 450 

<212> PRT 

<213> Homo sapiens 

<400> 23 

Met Gin Pro Pro Arg Glu Arg Leu Val Val Thr Gly Arg Ala Gly Trp 
1 5 10 ~ 15 

Met Gly Met Gly Arg Gly Ala Gly Arg Ser Ala Leu Gly Phe Trp Pro 
20 25 30 

Thr Leu Ala Phe Leu Leu Cys Ser Phe Pro Ala Ala Thr Ser Pro Cys 
35 40 45 

Lys lie Leu Lys Cys Asn Ser Glu Phe Trp Ser Ala Thr Ser Gly Ser 
50 55 60 

His Ala Pro Ala Ser Asp Asp Thr Pro Glu Phe Cys Ala Ala Leu Arg 
65 70 75 80 

Ser Tyr Ala Leu Cys Thr Arg Arg Thr Ala Arg Thr Cys Arg Gly Asp 
85 90 95 

Leu Ala Tyr His Ser Ala Val His Gly He Glu Asp Leu Met Ser Gin 
100 105 no 

His Asn Cys Ser Lys Asp Gly Pro Thr Ser Gin Pro Arg Leu Arg Thr 
115. 120 125 

Leu Pro Pro Ala Gly Asp Ser Gin Glu Arg Ser Asp Ser Pro Glu He 
130 135 140 



Cys His Tyr Glu Lys Ser Phe His Lys His Ser Ala Thr Pro Asn Tyr 
145 150 155 160 
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Thr Hie Cys Gly Leu Phe Gly Asp Pro His Leu Arg Thr Phe Thr Asp 
165 170 175 

Arg Phe Gin Thr Cys Lys Val Gin Gly Ala Trp Pro Leu He Asp Asn 
180 185 190 

Asn Tyr Leu Asn Val Gin Ala Thr Asn Thr Pro Val Leu Pro Gly Ser 
195 200 205 

Ala Ala Thr Ala Thr Ser Lys Leu Thr He lie Phe Lys Asn Phe Gin 
210 215 220 

Glu Cys Val Asp Gin Lys Val Tyr Gin Ala Glu Met Asp Glu Leu Pro 
225 230 235 240 

Ala Ala Phe Val Asp Gly Ser Lys Asn Gly Gly Asp Lys His Gly Ala 
245 250 255 

Asn Ser Leu Lys He Thr Glu Lys Val Ser Gly Gin His Val Glu He 
260 265 270 

Gin Ala Lys Tyr He Gly Thr Thr He Val Val Arg Gin Val Gly Arg 
275 280 '285 

Tyr Leu Thr Phe Ala Val Arg Met Pro Glu Glu Val Val Asn Ala Val 
290 295 300 

Glu Asp Trp Asp Ser Gin Gly Leu Tyr Leu Cys Leu Arg Gly Cys Pro 
305 310 315 320 

Leu Asn Gin Gin He Asp Phe Gin Ala Phe His Thr Asn Ala Glu Gly 
325 330 335 

Thr Gly Ala Arg Arg Leu Ala Ala Ala Ser Pro Ala Pro Thr Ala Pro 
340 345 350 

Glu Thr Phe Pro Tyr Glu Thr Ala Val Ala Lys Cys Lys Glu Lys Leu 
355 360 365 

Pro Val Glu Asp Leu Tyr Tyr Gin Ala Cys Val Phe Asp Leu Leu Thr 
370 375 380 

Thr Gly Asp Val Asn Phe Thr Leu Ala Ala Tyr Tyr Ala Leu Glu Asp 
385 390 395 400 

Val Lys Met Leu His Ser Asn Lys Asp Lys Leu His Leu Tyr Glu Arg 
405 410 415 

Thr Arg Asp Leu Pro Gly Arg Ala Ala Ala Gly Leu Pro Leu Ala Pro 
420 425 430 

Arg Pro Leu Leu Gly Ala Leu Val Pro Leu Leu Ala Leu Leu Pro Val 
435 440 445 

Phe Cys 
450 
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<210> 
<211> 
<212> 
<213> 



24 
478 
PRT 
Homo 



sapiens 



<400> 24 

Met lie Arg Lys Lys Arg Lys Arg Ser Ala Pro Pro Gly Pro Cys Arg 
15 10 15 

Ser His Gly Pro Arg Pro Ala Thr Ala Pro Ala Pro Pro Pro Ser Pro 
20 25 30 

Glu Pro Thr Arg Pro Ala Trp Thr Gly Met Gly Leu Arg Ala Ala Pro 
35 40 45 

Ser Ser Ala Ala Ala Ala Ala Ala Glu Val Glu Gin Arg Arg Ser Pro 
50 55 60 

Gly lieu Cys Pro Pro Pro lieu Glu Leu Leu Leu Leu Leu Leu Phe Ser 
65 70 75 80 

Leu Gly lieu Leu His Ala Gly Asp Cys Gin Gin Pro Ala Gin Cys Arg 
85 90 95 

lie Gin Lys Cys Thr Thr Asp Phe Val Ser Leu Thr Ser His Leu Asn 
100 105 110 

Ser Ala Val Asp Gly Phe Asp Ser Glu Phe Cys Lys Ala Leu Arg Ala 
115 120 125 

Tyr Ala Gly Cys Thr Gin Arg Thr Ser Lys Ala Cys Arg Gly Asn Leu 
130 135 140 

Val Tyr His Ser Ala Val Leu Gly lie Ser Asp Leu Met Ser Gin Arg 
145 150 155 160 

Asn Cys Ser Lys Asp Gly Pro Thr Ser Ser Thr Asn Pro Glu Val Thr 
165 170 175 

His Asp Pro Cys Asn Tyr His Ser His Ala Gly Ala Arg Glu His Arg 
180 185 190 

Arg Gly Asp Gin Asn Pro Pro Ser Tyr Leu Phe Cys Gly Leu Phe Gly 
195 200 205 

Asp Pro His Leu Arg Thr Phe Lys Asp Asn Phe Gin Thr Cys Lys Val 
210 215 220 

Glu Gly Ala Trp Pro Leu lie Asp Asn Asn Tyr Leu Ser Val Gin Val 
225 230 235 240 

Thr Asn Val Pro Val Val Pro Gly Ser Ser Ala Thr Ala Thr Asn Lys 



245 



250 



255 



lie 



Thr 



lie 



lie Phe Lys Ala His His Glu Cys Thr Asp Gin Lys 



Val 
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260 265 270 

Tyr Gin Ala Val Thr Asp Asp Leu Pro Ala Ala Phe Val Asp Gly Thr 
275 280 285 

Thr Ser Gly Gly Asp Ser Asp Ala Lys Ser Leu Arg He Val Glu Arg 
290 295 300 

Glu Ser Gly His Tyr Val Glu Met His Ala Arg Tyr He Gly Thr Thr 
305 310 315 320 

Val Phe Val Arg Gin Val Gly Arg Tyr Leu Thr Leu Ala He Arg Met 
325 330 335 

Pro Glu Asp Leu Ala Met Ser Tyr Glu Glu Ser Gin Asp Leu Gin Leu 
340 345 350 

Cys Val Asn Gly Cys Pro Leu Ser Glu Arg lie Asp Asp Gly Gin Gly 
355 360 365 

Gin val Ser Ala He Leu Gly His Ser Leu Pro Arg Thr Ser Leu Val 
370 375 380 

Gin Ala Trp Pro Gly Tyr Thr Leu Glu Thr Ala Asn Thr Gin Cys His 
385 390 395 ^ 400 

Glu Lys Met Pro Val Lys Asp He Tyr Phe Gin Ser Cys Val Phe Asp 
405 410 415 

Leu Leu Thr Thr Gly Asp. Ala Asn Phe Thr Ala Ala Ala His Ser Ala 
420 425 430 

Leu Glu Asp Val Glu Ala Leu His Pro Arg Lys Glu Arg Trp His He 
435 440 445 

Phe Pro Ser Ser Gly Asn Gly Thr Pro Arg Gly Gly Ser Asp Leu Ser 
450 455 460 

Val Ser Leu Gly Leu Thr Cys Leu He Leu. He Val Phe Leu 
465 470 475 



<210> 


25 


<211> 


420 


<212> 


PRT 


<213> 


Mus 


<400> 


25 



musculus 



Met Gly Gin Ser Pro Ser Pro Arg Ser Pro His Gly Ser Pro Pro Thr 
15 10 15 

Leu Ser Thr Leu Thr Leu Leu Leu Leu Leu .Cys Gly Gin Ala His Ser 
20 25 30 



Gin Cys Lys He Leu Arg Cys Asn Ala Glu Tyr Val Ser Ser Thr Leu 
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35 40 45 

Ser Leu Arg Gly Gly Gly Ser Pro Asp Thr Pro Arg Gly Gly Gly Arg 
50 55 60 

Gly Gly Leu Ala Ser Gly Gly Leu Cys Arg Ala Leu Arg Ser Tyr Ala 
65 70 75 80 

Leu Cys Thr Arg Arg Thr Ala Arg Thr Cys Arg Gly Asp Leu Ala Phe 
85 90 95 

His Ser Ala Val His Gly lie Glu Asp Leu Met lie Gin His Asn Cys 
100 105 no 

Ser Arg Gin Gly Pro Thr Ala Pro Pro Pro Ala Arg Gly Pro Ala Leu 
115 120 125 

Pro Gly Ala Gly Pro Ala Pro Leu Thr Pro Asp Pro Cys Asp Tyr Glu 
130 135 140 

Ala Arg Phe Ser Arg Leu His Gly Arg Ala Pro Gly Phe Leu His Cys 
145 150 155 160 

Ala Ser Phe Gly Asp Pro His Val Arg Ser Phe His Asn Gin Phe His 
165 170 175 

Thr Cys Arg Val Gin Gly Ala Trp Pro Leu Leu Asp Asn Asp Phe Leu 
180 185 190 

Phe Val Gin Ala Thr Ser Ser Pro Val Ser Ser Gly Ala Asn Ala Thr 
195 200 205 

Thr lie Arg Lys lie Thr He He Phe Lys Asn Met Gin Glu Cys He 
210 215 220 

Asp Gin Lys Val Tyr Gin Ala Glu Val Asp Asn Leu Pro Ala Ala Phe 
225 230 235 240 

Glu Asp Gly Ser He Asn Gly Gly Asp Arg Pro Gly Gly Ser Ser Leu 
245 250 255 

Ser He Gin Thr Ala Asn Leu Gly Ser His Val Glu He Arg Ala Ala 
260 265 270 

Tyr lie Gly Thr Thr He He He Arg Gin Thr Ala Gly Gin Leu Ser 
275 280 285 

Phe Ser He Arg Val Ala Glu Asp Val Ala Arg Ala Phe Ser Ala Glu 
290 295 300 

Gin Asp Leu Gin Leu Cys Val Gly Gly Cys Pro Pro Ser Gin Arg Leu 
305 310 315 320 

Ser Arg Ser Glu Arg Asn Arg Arg Gly Ala He Ala He Asp Thr Ala 
325 330 ~ 335 
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Arg Arg Leu Cys Lys Glu Gly Leu Pro Val Glu Asp Ala Tyr Phe Gin 
34 0 345 



350 



Ser Cys Val Phe Asp Val Ser Val Ser Gly Asp Pro Asn Phe Thr Val 
355 360 



365 



Ala Ala Gin Thr Ala Leu Asp Asp Ala Arg lie Phe Leu Thr Asp Leu 
370 375 380 

Glu Asn Leu His Leu Phe Pro Ser Asp Ala Gly Pro Pro Leu Ser 
385 390 395 



Pro 
400 



Ala lie Cys Leu Val Pro Leu Leu Ser Ala Leu Phe Val Leu Trp Leu 
405 410 415 

Cys Phe Ser Lys 
420 



<210> 26 

<211> 422 

<212> PRT 

<213> Rattus rattus 

<400> 26 

Met Gly Asp Arg Gly Arg Ser Pro Ser Leu Arg Ser Pro His Gly Ser 
1 5 10 15 

Pro Pro Thr Leu Ser Thr Leu Thr Leu Leu Leu Leu Leu Cys Gly Gin 
20 25 3 o 

Ala His Ser Gin Cys Lys lie Leu Arg Cys Asn Ala Glu Tyr Val Ser 
35 40 45 

Phe Thr Leu Ser Leu Arg Gly Gly Gly Ser Pro Asp Thr Pro Arg Glv 
50 55 60 

Gly Gly Arg Gly Gly Pro Ala Ser Gly Gly Leu Cys Arg Ala Leu Arg 
65 70 75 ~ bo 

Ser Tyr Ala Leu Cys Thr Arg Arg Thr Ala Arg Thr Cys Arg Gly Asp 
85 90 95 

Leu Ala Phe His Ser Ala Val His Gly He Glu Asp Leu Met He Gin 
100 105 no 

His Asn Cys Ser Arg Gin Gly Pro Thr Ala Ser Pro Pro Ala Arg Glv 
115 120 125 

Pro Ala Leu Pro Gly Ala Gly Pro Ala Pro Leu Thr Pro Asp Pro Cys 
130 "5 140 

Asp Tyr Glu Ala Arg Phe ser Arg Leu His Gly Arg Thr Pro Gly Phe 
145 15 ° 155 i 6 o 

Leu His Cys Ala Ser Phe Gly Asp Pro* His Val Arg Ser Phe His Asn 
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165 170 175 

His Phe His Thr Cys Arg Val Gin Gly Ala Trp Pro Leu Leu Asp Asn 
180 185 190 

Asp Phe Leu Phe Val Gin Ala Thr Ser Ser Pro Val Ala Ser Gly Ala 
■ 195 200 205 

Asn Ala Thr Thr lie Arg Lys lie Thr lie lie Phe Lys Asn Met Gin 
210 215 220 

Glu Cys lie Asp Gin Lys Val Tyr Gin Ala Glu Val Asp Asn Leu Pro 
225 230 235 240 

Ala Ala Phe Glu Asp Gly Ser Val Asn Gly Gly Asp Arg Pro Gly Gly 
245 250 255 

Ser Ser Leu Ser lie Gin Thr Ala Asn Leu Gly Ser His Val Glu lie 
260 265 270 

Arg Ala Ala Tyr Xle Gly Thr Thr lie lie Val Arg Gin Thr Ala Gly 
275 280 285 

Gin Leu Ser Phe Ser lie Arg Val Ala Glu Asp Val Ala Arg Ala Phe 
290 295 300 

Ser Ala Glu Gin Asp Leu Gin Leu Cys Val Gly Gly Cys Pro Pro Ser 
305 310 315 320 

Gin Arg Leu Ser Arg Ser Glu Arg Asn Arg Arg Gly Ala lie Ala He 
325 330 335 

Asp Thr Ala Arg Arg Leu Cys Lys Glu Gly Leu Pro Val Glu Asp Ala 
340 345 350 

Tyr Phe Gin Ser Cys Val Phe Asp Val Ser Val Ser Gly Asp Pro Asn 
355 360 365 

Phe Thr Val Ala Ala Gin Ser Ala Leu Asp Asp Ala Arg Val Phe Leu 
370 375 380 

Thr Asp Leu Glu Asn Leu His Leu Phe Pro Val Asp Ala Gly Pro Pro 
385 390 395 400 

Leu Ser Pro Ala Thr Cys Leu Val Arg Leu Leu Ser Val Leu Phe Val 
405 410 415 

Leu Trp Phe Cys He Gin 
420 



<210> 27 

<211> 366 

<212> PRT 

<213> Fugu 
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<400> 27 

Ala Ser Cys Arg lie Leu Arg Cys Asn Ser Asp Phe Val Ala Ala Thr 
1 5 io 15 

Leu Asp Leu Gly Ser Ser Ala Gly Ala Gly Gly Gly Ala Pro Leu Ser 
20 25 30 

Arg Glu Ala Ala Aen Ala Glu Tyr Cys Arg Ala Leu His Ser Tyr Ser 
35 40 45 

Thr Cys Thr Lys Arg Met Ala Arg Pro Cys Arg Gly Asp Leu Ala Tyr 
50 55 60 

His Ser Ala Val Gin Gly lie Glu Asp Leu Leu lie Gin Tyr Arg Cys 
65 70 75 so 

Pro Leu Ala Gly Pro Thr Ala Gin Pro Arg Pro Leu Pro Pro Leu Leu 
85 90 95 

Ser Gly Asp Val Cys Leu, Tyr Asp Arg Arg Leu Ala Ala Ala Glu Ala 
100 105 no 

Pro Gin Pro Asp Tyr Leu His Cys Gly Val Phe Gly Asp Pro His lie 
US 120 125 

Arg Thr Phe Asn Asn Asp Phe His Thr Cys Ala Val Gin Gly Ala Tm 
130 135 i 4 o 

Pro Leu lie Asp Asn Asp Phe Leu Tyr Val Gin Ala Thr Ser Ser Pro 
5 150 155 . i 6 o 

Thr Arg Arg Gly Thr Gin Ala Thr Met Leu Thr Lys lie Thr Val He 
16 5 170 175 

Val Lys Ser Trp Arg His Cys Val Asp Gin Gin Leu Tyr Gin Ala Glu 
180 185 190 

Leu Asp Asp Val Pro Met Ala Phe Ala Asp Gly Ser Val Val Ser Gly 
19S 200 2 05 

Glu Arg Arg Gly Gin His Thr Leu Ala He Thr Gin Ser Pro Gly Arg 
210 215 220 



His Ala Glu He Arg Ala Ala His He Ala Thr Val Ala Ser Gly Gin 
225 . 230 235 240 

Ser Gly Arg Ser Leu Ser Leu Ser Val Tyr Ser Pro Arg Ser Val Val 
245 250 255 

Glu Ala Phe Gly Pro Glu Gin Asp Leu Gin Leu Cys Met Trp Gly Cys 
260 265 270 

Pro Ala Ser Gin Lys Leu Ser Thr Pro Pro Pro Thr Ser Ser Thr Phe 
275 280 285 

Ser Ala Ala Val Leu Ala His Cys Asp Ala Leu Leu Pro Val Arg Asp 
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290 295 300 

Val Tyr His Gin Ala Cys lie Phe Asp Leu lie Thr Ser Gly Asp Leu 

305 310 .315 320 

Asn Ser Ser Gly Ala Ala lie Ser Ala Leu Gin Asp Ala Gin Lys Leu 

325 330 335 

lie Ser Asp Pro Lys Arg Val His Leu Leu Ser Pro Thr Ser Ala Ala 

340 345 350 

Gin Arg Glu Asp His Leu Cys Leu Leu Leu Leu Leu Leu Ser 

355 360 365 



<210> 28 

<211> 432 

<212> PRT 

<213> Chicken 

<400> 28 

Met Gly Arg Gly Ala Gly Ser Thr Ala Leu Gly Leu Phe Gin lie Leu 
1 5 10 15 

Pro Val Phe Leu Cys He Phe Pro Pro Val Thr Ser Pro Cys Lys He 
20 25 30 

Leu Lys Cys Asn Ser Glu Phe Trp Ala Ala Thr Ser Gly Ser His His 
35 40 45 

Leu Gly Ala Glu Glu Thr Pro Glu Phe Cys Thr Ala Leu Arg Ala Tyr 
50 .55 60 

Ala His Cys Thr Arg Arg Thr Ala Arg Thr Cys Arg Gly Asp Leu Ala 
65 70 75 80 

Tyr His Ser Ala Val His Gly He Asp Asp Leu Met Val Gin His Asn 
85 90 95 

Cys Ser Lys Asp Gly Pro Thr Ser Gin Pro Arg Leu Arg Thr Leu Pro 
100 105 HO 

Pro Gly Asp Ser Gin Glu Arg Ser Asp Ser Pro Glu He Cys His Tyr 
115 120 125 

Glu Lys Ser Phe His Lys His Ser Ala Ala Pro Asn Tyr Thr His Cys 
130 135 140 

Gly Leu Phe Gly Asp Pro His Leu Arg Thr Phe Thr Asp Thr Phe Gin 
145 150 155 160 

Thr Cys Lys Val Gin Gly Ala Trp Pro Leu He Asp Asn Asn Tyr Leu 
165 170 175 

Asn Val Gin Val Thr Asn Thr Pro Val Leu Pro Gly Ser Ser Ala Thr 
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X80 185 190 

Ala Thr Ser Lys Leu Thr lie He Phe Lys Ser Phe Gin Glu Cys Val 
195 200 205 

Glu Gin Lys Val Tyr Gin Ala Glu Met Asp Glu Leu Pro Ala Ala Phe 
210 215 220 

Ala Asp Gly Ser Lys Asn Gly Gly Asp Lys. His Gly Ala Asn Ser Leu 
22S 230 235 240 

Lys He Thr Glu Lys Val Ser Gly Gin His He Glu He Gin Ala Lys 
245 250 255 

Tyr He Gly Thr Thr He Val Val Arg Gin Val Gly Arg Tyr Leu Thr 
260 265 270 

Phe Ala Val Arg Met Pro Glu Glu Val Val Asn Ala Val Glu Asp Arg 
275 280 285 

Asp Ser Gin Gly Leu Tyr Leu Cys Leu Arg Gly Cys Pro Leu Asn Gin 
290 295 300 

Gin He Asp Phe Gin Thr Phe Arg Leu Ala Gin Ala Ala Glu Gly Arg 
305 310 315 320 

Ala Arg Arg Lys Gly Pro Ser Leu Pro Ala Pro Pro Glu Ala Phe Thr 
325 330 335 

Tyr Glu Ser Ala Thr Ala Lys Cys Arg Glu Lys Leu Pro Val Glu Asp 
340 345 350 

Leu Tyr Phe Gin Ser Cys Val Phe Asp Leu Leu Thr Thr Gly Asp Val 
355 360 365 

Asn Phe Met Leu Ala Ala Tyr Tyr Ala Phe Glu Asp Val Lys Met Leu 
370 375 380 

His Ser Asn Lys Asp Lys Leu His Leu Tyr Glu Arg Thr Arg Ala Leu 
385 390 395 400 

Ala Pro Gly Asn Ala Ala Pro Ser Glu His Pro Trp Ala Leu Pro Ala 
405 410 415 

Leu Trp Val Ala Leu Leu Ser Leu Ser Gin Cys Trp Leu Gly Leu Leu 
420 425 430 



<210> 29 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Polynucleotide replication primer 
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<400> 29 

tccaagtcag cgactctctc g 



21 



<210> 30 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Polynucleotide replication primer 



<400> 30 

tccaagtcag tgactctctc g 



21 



<210> 31 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
<400> 31 

acctgccgcg gggacctcgc c 



<210> 32 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 32 

acctgccgcg tggacctcgc c 21 



<210> 33 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 33 

gcctgggaaa cctggctgga t 21 



<210> 34 

<211> 21 

<212> DNA 

<213> Artificial 
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<220> 

<223> Fragment containing polymorphism 
<400> 34 

gcctgggaaa gctggctgga t 21 



<210> 35 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 35 

tcccttctgt ctttagctca t 21 



<210> 36 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
<400> 36 

tcccttctgt gtttagctca t 21 

<210> 37 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 
<400> 37 

gaggaggagg ccggggtgga 20 



<210> 38 

<211> 23 

<212> DNA 

<213> Artificial 



<220> . 

<223> Fragment containing polymorphism 



<400> 38 

gaggaggagg aggccggggt gga 



23 
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<210> 39 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 39 

gcctccctgc cccggaccct t 21 



<210> 40 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
<400> 40 

gcctccctgc gccggaccct t 21 



<210> 41 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
<400> 41 

atggtcgtcc cccggggttc t 21 



<210> 42 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 42 

atggtcgtcc accggggttc t 21 



<210> 43 . 

♦<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
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<400> 43 

cgtcccccgg ggttcttgca t 21 

<210> 44 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 
<400> 44 

cgtcccccgg cgttcttgca t 21 

<210> 45 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 
<400> 45 

gtccaaggag cttggcctct a 21 

<210> 46 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 
<400> 46 

gtccaaggag attggcctct a 21 

<210> 47 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 
<400> 47 

cccccatggc gttgggggcc a 21 

<210> 48 

<211> 21 

<212> DNA 

<213> Artificial 
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<220> 

<223> Fragment containing polymorphism 
<400> 48 

cccccatggc tttgggggcc a 



<210> 49 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
<400> 49 

taagaacatg caggaatgca t 



<210> 50 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 50 

taagaacatg aaggaatgca t 

<210> 51 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Fragment containing polymorphism 
<400> 51 

gccttctcag ctgaacagga c 



<210> 52 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 52 

gccttctcag gtgaacagga c 



<210> 53 
<211> 21 
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<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 
<400> 53 

agatgctggg gttcctcttt c 



<210> S4 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Fragment containing polymorphism 

<400> 54 

agatgctggg attcctcttt c 21 



<210> 55 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Forward replication primer 
<400> 55 

cacttgagcc caggaatttg 

<210> 56 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse replication primer 
<400> 56 

gactcactgc agccttgacc 



<210> 57 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> Forward replication primer 



<400> 57 

gtgtgctaca agtttgccga at 
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<210> 58 

<2X1> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse replication primer 

<400> 58 

gcttgaaact gggagttgga 

<210> 59 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward replication primer 

<400> 59 22 
gggaaatggt cccataattc ct 

<210> . 60 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse replication primer 

<400> 60 19 
cgccctgcca atatgttct 

<210> 61 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward replication primer 

<400> 61 22 



ggtacttagc ctcgaaatga ga 



<210> 62 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Reverse replication primer 
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<400> 62 

gtgtcacaca actggttggt 



20 



